Text as Data
Big data and computational power are growing fast, and it’s not just the big tech companies making use of it. Social sciences are also benefiting from these advances. Natural Language Processing (NLP) has come a long way, and now researchers can analyze text data in ways that were impossible just a few decades ago. Social media posts, transcripts of political debates, and parliamentary speeches are now valuable data sources for studying human behavior, political discourse, and societal trends. This opens up a bunch of new questions and gives us new ways to study them.
Our “Text-as-Data” project uses NLP methods to analyze how politicians communicate across social media and other channels. We investigate the dynamics of disinformation, examining how false or misleading information spreads and evolves within political communication. Understanding these patterns is crucial in an age of eroding trust in democratic institutions and polarized societies. We also study how different groups of citizens are addressed, shedding light on patterns of political inequality in messaging. This is important for assessing whether political communication reflects or exacerbates existing social divides, and for ensuring that all voices are equally valued in public discourse. Additionally, we analyze whether there has been an ideological shift in political discourse in recent years, helping us understand how political narratives adapt to societal changes and shape public opinion. By addressing these challenges, our research aims to contribute to a deeper understanding of contemporary political dynamics and the role of communication in shaping democracy.
To answer these questions, we are creating new methods to combine data from different sources. This lets us find connections and trends across different types of communication. This approach helps us understand political communication better. It also gives us new tools and frameworks for analyzing complex social phenomena.