Misinformation

  • ComLex: An emotional and topical lexicon of 300 clusters, generated from user comments on social media.
    Only 56 named clusters are human evaluated.
    [ Download ]

  • Fact-Checked Posts: A dataset of 5K+ social media posts fact-checked by Snopes or PolitiFact.
    [ Download ]

  • User Comments: A dataset of 2.6M+ user comments on social media for above posts.
    [ Facebook | Twitter | YouTube ]

Partisan Bias

  • PolarShare: Visualization of media bias by polarized sharing on Twitter.
    Available at: https://polarshare.shanjiang.me

  • Data: The complete dataset for 10K+ websites is available upon requests.

Ridesharing

  • TNCsToday: Visualization of Uber and Lyft drivers in San Francisco.
    Available at: https://tncstoday.sfcta.org

  • Data: Unfortunately, due to Uber’s and Lyft’s Terms of Service, we cannot make the data from the study publicly available.