Wednesday, November 21, 2012

Yelp Transit Maps

Yelp-rated routes in Boston. Click to embiggen.
I noticed a while ago—as did some other folks—that Yelp users have been, for some time, rating transit lines. I was intrigued. Here was really interesting data about how people felt about different transit lines, distilled in to a simple 1-to-5 rating. While not every line was ranked in Boston (my first search) there were plenty that were, and I compiled a list of routes, star-ratings and the number of Yelps.

In Boston, only some routes were rated, and they were, not surprisingly, centered in the more student- and hipster-centric part of the city. For instance, no bus line in Dorchester or Mattapan got Yelped, but most in Cambridge and Somerville have many reviews. I figured the best way to show these data was on a map, and after some machinations (especially in resorting the shapefile so the thinner lines would display on top of the thicker ones) I got the map above. It's pretty cool—click it to enlarge. (Here's the full MBTA system map if you're not familiar with the lines; I left off route numbers for clarity.)

But I realized that pretty cool wasn't cool enough. There was probably a much richer data set out there. Boston has about 750 Yelp reviews, 450 of those for rail lines. Was there city with a wired-in community, lots of bus and transit routes, and high transit ridership? Did I just describe San Francisco to a T? And, voila, there are nearly 2000 Yelp reviews of transit lines in San Francisco, at least one (and usually many more) for nearly every line Muni runs (see exceptions below). (Here's the Muni system map.)
Muni Lines, as reviewed by Yelp. Click to Embiggen.
San Francisco inset. Click to embiggen.
That. Is. Sexy. The N-Judah has nearly 200 reviews. Wow. And in case the downtown area is too clustered for you, there's an inset to the right.

I also realized that I had a pretty fun data set here, too. I went to a talk by Jarrett Walker the other day at MIT where he mentioned, amongst other things, that we should not focus on the technology used for transit, but whether if fulfills the mission of getting people from one place to another. In San Francisco, we have a jumble of buses, trolleybuses, streetcars and even cable cars and we have a pretty good way of quantifying whether they are accomplishing the job of transit. (In Boston, even though the B Line serves tens of thousands of passengers a day it manages a 1.36 Yelp rating—remarkable as the lowest possible rating is 1. None of its 34 raters give it a 4 or a 5. Still, it moves a lot of people marginally faster than they could walk.)

First, I averaged the ratings by technology type. Trolleybus route get more reviews than bus routes, probably because they are more heavily used. The average rating for these, however, is quite similar. (The average is a straight average of each line, the weighted average weighs more frequently-rated lines by the number of ratings). Cable cars and PCCs (F-Marked and Wharves) have higher ratings but many are likely by tourists. Light rail lines, however, are frequently rated, and given low ratings, significantly lower than the bus routes.

Vehicle typeRoutesAvg ReviewsAvg StarsWeighted Avg
Cable Car3643.813.87
Light Rail5652.312.43

A forthcoming post will compare local and express bus routes. (Hint: people like riding expresses more than locals.)

I am so interested in San Francisco's Yelp bus ratings that I've tabled the whole of the network.
LineVehicleStars# RatingsLineVehicleStars# Ratings
33Trolleybus3.2433CaliforniaCable Car4.1369
36Bus2.110FPCC Streetcar3.42114
37Bus3.4212JLight Rail2.4945
38Bus2.45119KTLight Rail2.1323
41Trolleybus2.8217LLight Rail2.1338
43Bus2.8228MLight Rail2.2331
44Bus2.8324NLight Rail2.55189
45Trolleybus2.6221Powell-HydeCable Car3.7899
47Bus2.1323Powell-MasonCable Car3.5225

The only lines not Yelped are the 35-Eureka and 56-Rutland. These lines have 30-minute headways (as does the 17-Parkmerced, see this route service chart with headways for all lines) while most lines in San Francisco have service every 15 minutes or better.

Next up: New York's subways. And beyond.


  1. The train ratings match my experience well. Let's put the lower-ridership Blue Line aside, since I've only used it once and it serves a different part of the city. Of the other three, the Red Line is the best in terms of frequency, train modernity, and station quality (at least from South Station north; south of South Stations it's probably not as nice, but fewer people use it so it doesn't weigh down the ratings too much). The Orange is better than the Green in terms of rolling stock quality, and also in terms of frequency unless you're using the Green Line's shared section. Really the problem is that in the off-off-peak, the Green Line branches and the Orange Line drop to really bad frequencies, like 13 or 16 minutes; the Red Line branches do the same, but most ridership is north of JFK-UMass so it's less of a problem.

    Now if only the MBTA turned those off-peak headways into 15-minute clockface patterns, posted clearly at each station...

  2. Great post, lot of good data in here.

    We think Yelp isn't the ideal way for people to rate and review their MBTA rides so we're building a site just for riders and commuters. Check us out at

    Our data so far from Twitter has shown that the REd Line is the most reviewed line and also among the poorest. Daily delays and breakdowns. The Green Line is right behind it while the Orange Line ranks higher.

  3. @ratethembta, that's a great idea. I think this is helpful in the sense that these are ratings from a wider population; RateTheMBTA will probably pull from a different subset than Yelpers.

    @Alon, agree with most points. The OL frequencies are pretty awful off-peak, and it would be helpful if they were clockface, although the real time prediction apps (and new signage at most stations) help it. The GL can be unpredictable and there's no way to know where the trains are; they really need some technology there. The one saving grace is that inbound during the evenings the D Line at least runs on a pretty good schedule although, again, it's nowhere near clockface (every 13 minutes or something silly).