Content Moderation

  • Data: YouTube comments with moderation outcome, video misinformation, channel partisanship, linguistic and social engagement controls.
    [ Download ]


  • ComLex: An emotional and topical lexicon of 300 categories from user comments on social media.
    Only 56 named categories are human evaluated.
    [ Download ]

  • Fact-Checked Posts: A dataset of 5K+ social media posts fact-checked by Snopes or PolitiFact.
    [ Download ]

  • User Comments: A dataset of 2.6M+ user comments on social media for above posts.
    [ Facebook | Twitter | YouTube ]


  • TNCsToday: Visualization of Uber and Lyft drivers in San Francisco.
    Available at:

  • Data: Unfortunately, due to Uber’s and Lyft’s Terms of Service, we cannot make the data from the study publicly available.