The Digital Methods School

Empowering researchers to unlock closed societies through open data

The Digital Methods School advances the study of Russia and other semi-closed societies by equipping scholars and experts with accessible digital research methods. We provide curated databases, innovative tools, and hands-on training to turn digital traces into reliable empirical evidence—without requiring specialized technical skills. By democratizing access to cutting-edge methodologies, we foster rigorous, data-driven research that illuminates societies otherwise difficult to reach.

Sign up to join us

DIGITAL RESEARCH TRAINING ON RUSSIA

Designed for research in the social sciences and humanities, the program equips a broad community of scholars, policy analysts, and business experts with practical tools to collect, analyze, and interpret data on Russia — helping them generate evidence-based insights into a semi-closed society. The trainings are structured to be accessible to participants with no prior experience in digital research, while also offering advanced applications for experienced researchers.

Participants will gain hands-on experience with a wide range of methods, including text data analysis, administrative data research, integration of AI and large language models into the research process, and survey data design and evaluation under the specific constraints of Russia. These skills can be applied to pressing research problems such as analyzing political, social, and economic attitudes under authoritarian conditions from tracing the internalization of state narratives within populations to examining Russia’s strategic decision-making. More broadly, they open new pathways for uncovering empirical evidence in contexts where conventional fieldwork is restricted or impossible.

ORGANIZATION

All training sessions are conducted online and are free of charge. We hold one training block per semester, with each topic announced alongside a call for applications. Each training includes 15-25 hours of lectures and dedicated workshops. Lectures within each block are open to an unlimited number of participants, but require applications to be submitted by the stated deadline. Workshops, by contrast, have limited enrollment and focus on personalized work—either guiding students through practical tasks or supporting them in advancing their own research under the supervision of a lecturer.

Trainings are organized in a hybrid model:

Lectures distributed weekly
Workshops in intensive blocks

We strongly encourage the creation of knowledge commons and networking among participants. To foster this, we combine different platforms for communication and resource sharing: YouTube for tutorials and selected lectures, Discord for peer exchange, and Zoom for personalized interaction.

Curriculum

Introduction by Ivan Grek
Tuesday March 3 18:00 CET / 12:00 PM EST

Lectures: Every Wednesday March 4 - April 15 18:00 CET / 12:00 PM EST

Workshops: Every Thursday March 5 - April 16 18:00 CET / 12:00 PM EST

Sign up to join us.

March 3: Creative digital thinking, Introduction

by Ivan Grek, the director of the Russia Program

Creative Digital Thinking: Using Technology in Social Science and Humanities Research. We will explore how digital tools can transform research in the social sciences and humanities. This is an introduction to the field of digital research and an overview of practical solutions and analytical capabilities developed through the Russia Program’s studies.

Ivan Grek, PhD, is Director of the Russia Program at The George Washington University, where he focuses on the application of digital tools and innovative methodologies for scholarly and policy research on Russia. His research examines ideology, corporate structural power, and geopolitical transformation in the emerging world order, combining political theory with data-driven analysis.

March 4 and 5: Seeing the Invisible: Studying Russia through Alternative and Indirect Data Sources

by Alexander Keysut, Senior Researcher, Cedar

Since 2022, traditional methods of social science research in Russia — including fieldwork and surveys — have become increasingly difficult to implement. At the same time, a wide range of administrative, digital, and textual data remains available, allowing researchers to study social, political, and economic processes indirectly.

We will survey the ecosystem of data sources that remain available for studying contemporary Russia. The lecture will cover administrative registries, court and procurement records, business databases, digital platforms and social media, and the still-extensive official statistics. We will discuss what kinds of research questions each source enables, where and how these data can be accessed, and what biases and risks they involve. The goal is to provide a practical map of empirical evidence that researchers can still rely on under conditions of restricted access.

During the workshop on March 5 we will focus on working with Russian court decisions as a large-scale textual dataset. Participants will examine the use of large language models (LLMs) to extract entities and structured information from legal texts, with attention to practical workflows and validation issues.

Alexander Keysut is an economist and data analyst with many years of experience working on open data projects in Russia. He contributed to the development of a court decisions parser covering Russian courts — a tool downloaded several hundred times by researchers and independent journalists, and used as the basis for more than a dozen published investigations and analytical pieces. Alexander specializes in large-scale data processing, statistical analysis, and building data infrastructure for civic and research purposes, with deep expertise in working with Russian administrative and judicial datasets.

March 11 and 12: Working with Russian Data Using AI: A Crash Course

by Damir Malikov, AI Lead at Kronika

A practical introduction to large language models for researchers working with Russian-context data. What LLMs can and can't do, where to find relevant datasets, how to collect and process data at scale — from no-code tools to programmatic pipelines. The session ends with a hands-on workshop: entity extraction from a Russian text corpus. No prior AI experience required.

March 18 and 19: Prompt Engineering: Unlocking the Full Power of AI

by Alexey Sidorenko, the director of Teplitsa. Technologies for Social Good

Large language models (LLMs) are increasingly shaping research, policy analysis, and media work. Their effectiveness depends on how interaction with them is structured. Prompt engineering provides a methodological framework for producing reliable, analytically useful outputs across both GUI environments and API integrations.

This two-day lecture equips participants with the ability to:

Understand how LLMs process instructions, tokens, and context windows
Design structured prompts for research, analytical, and policy-oriented tasks
Apply various techniques to improve depth and precision
Use role definition, constraints, and stepwise reasoning to enhance output quality
Identify bias, hallucinations, and uncertainty in model responses
Evaluate and validate AI-generated content for academic and professional use
The session frames prompting as a disciplined approach to human–AI interaction, with emphasis on reproducibility, epistemic reliability, and analytical accountability in politically sensitive and information-dense contexts.

Alexey Sidorenko is director of Teplitsa: Technologies for Social Good, a capacity-building initiative supporting pro-democracy activists, civil society organizations, and independent media in authoritarian environments. For over a decade, he has helped journalists, activists, and NGOs build digital resilience through practical tools, security practices, and strategic thinking. He focuses on AI's impact on civil society and media — delivering AI training for journalists, running AI hackathons, advising newsrooms on responsible AI deployment, and overseeing the launch of factbutcher.com, a GPT-powered fact-checking chatbot. Alexey holds a Ph.D. in Geography (Moscow State University, 2010), and several AI certifications, including an IBM AI Product Manager certification, and Google's Responsible AI for Developers certification.

March 25 and 26: From the Kremlin to Pandas

by Dr Bartłomiej Gajos, historian of Russia, visiting Fellow at Harvard University

A short, practical course for students. This session will show you how a historian uses data science to work with large Russian-language source collections. It focuses on the real challenges of Slavic inflectional languages (especially Russian) demonstrating how you can turn messy historical materials into clean datasets ready to be analyzed.

The following questions will be addressed:

Why Russian (and other Slavic languages) are tricky for quantitative text analysis (many word forms, spelling and style variation),
How to make key preparation choices: normalisation, stemming vs lemmatisation, detecting duplicates and near-duplicates, and why these decisions change your results,
How to treat official sources critically (provenance, context, and what “the data” really represents),
How to go from PDF to dataset: extracting text from born-digital and scanned PDFs, cleaning Cyrillic text, adding metadata, and exporting to CSV / DataFrame.

We will work with pre-prepared real-world research datasets, including a substantial collection of materials from Kremlin.ru and a complex PDF example for the practical session. The course will demonstrate that data science in the humanities is not only about coding. Rather, it is about making methodological decisions with sources.

Dr Bartłomiej Gajos is a historian of Russia who combines archival research with computer-based analysis of large datasets (data science). He specialises in politics of memory and in how the Kremlin uses history for political purposes. He is a Visiting Fellow at Harvard University under a scholarship from the Kościuszko Foundation. He earned his PhD at the Tadeusz Manteuffel Institute of History, Polish Academy of Sciences, with a dissertation on the Bolsheviks’ politics of memory (1917–1920). His academic achievements have been recognised with Poland’s Prime Minister’s Award, a START scholarship from the Foundation for Polish Science, and a Diamond Grant from the Ministry of Science and Higher Education. He currently works at the Juliusz Mieroszewski Centre in Warsaw. Together with Ernest Wyciszkiewicz, he co-hosts the YouTube programme Polihistor 2.0 devoted to Russia, which reaches around 250,000 views per month.

April 1 and 2: Using AI for Russian Policy Analysis (Cybersecurity and Cybercensorship)

by Alexey Sidorenko, the director of Teplitsa. Technologies for Social Good

AI in Policy Analysis

Russian Policymaking System

Formal and Informal Institutions

Cybersecurity Policy

- Narrative and Discourse Analysis of Policy Documents and Speeches

Cybercensorship

- Introduction to Internet measurement domain

- Cybercensorship research approaches

Alexey Sidorenko is director of Teplitsa: Technologies for Social Good, a capacity-building initiative supporting pro-democracy activists, civil society organizations, and independent media in authoritarian environments. For over a decade, he has helped journalists, activists, and NGOs build digital resilience through practical tools, security practices, and strategic thinking. He focuses on AI's impact on civil society and media — delivering AI training for journalists, running AI hackathons, advising newsrooms on responsible AI deployment, and overseeing the launch of factbutcher.com, a GPT-powered fact-checking chatbot. Alexey holds a Ph.D. in Geography (Moscow State University, 2010), and several AI certifications, including an IBM AI Product Manager certification, and Google's Responsible AI for Developers certification.

April 8 and 9: Computational Text Analysis for Russian-Language Media

by Raushan Zhandayeva, PhD, Postdoctoral Associate at the Institute for Data, Democracy, and Politics, GWU

This lecture covers text-as-data: from acquiring and preprocessing text, to building document-feature matrices, to the unique challenges of Russian morphology. It then introduces three topic modeling approaches (LDA, STM, and KeyATM) and two sentiment analysis methods: dictionary-based and transformer-based emotion detection. The examples are drawn from a 125,000+ article corpus of Russian and Western state-affiliated media coverage of the war on Ukraine.

April 15 and 16: Monitoring narratives and identifying bot farms on social networks

by Pavel Bannikov, OSINT specialist

This compact module focuses on how to track propaganda narratives and detect coordinated inauthentic behavior specifically on Telegram and VK, using advanced search techniques, alternative search, and practical AI‑assisted workflows. Participants will learn to move from ad‑hoc channel browsing to systematic, reproducible analysis of large Telegram ecosystems relevant to Russia’s war, sanctions, foreign and domestic politics.

Core focus

Telegram as a key battlefield for pro‑Kremlin and anti‑Kremlin narratives, including semi‑closed ecosystems, comment floods, and anonymous political channels.
Deep search workflows: from keyword strategies and Boolean queries to using specialized OSINT tools and web-interfaces.
AI tools for narrative and bot‑network analysis: clustering posts, detecting emerging frames, spotting likely bots via content similarity, timing, and cross‑channel patterns.

Lecture

How does AI change propaganda worldwide?
Telegram channels, groups, and comment sections are used in Russian information operations and counter‑narratives.
Typical signatures of inauthentic behavior on social networks.
Tools and tips for monitoring Telegram, VK, and OK.
Integrating AI into OSINT: using LLMs and ML‑based tools for narrative labeling, topic clustering, sentiment, and anomaly detection, with validation and bias checks.
OPSEC: why you have to stay invisible and how.

How to work with LLMs locally without CLI.

Workshop

Working with real Telegram data (curated or brought by participants), small groups will:

Define the first appearance of narratives/viral posts.
Build a focused monitoring query around one narrative (e.g., “referendums,” “mobilization,” “Russophobia”) and collect posts/comments using deep search and open‑source tools.
Build and use AI‑assisted workflows to cluster messages, highlight recurrent frames, and flag suspected bot comments or channels.
OPTIONAL | Produce a short analytic note that combines narrative evolution, basic quantitative indicators, and a preliminary map of likely inauthentic activity.

Pavel Bannikov is a linguist, fact-checker, OSINT specialist, and trainer with extensive experience in investigative journalism, media literacy, and digital security across Central Asia, Eastern Europe, the Baltics, the Caucasus, and Russia. He is Co-founder and CPO at Tesari AI, an AI copilot for investigative intelligence, and an editor at Provereno Media. Since 2017, Pavel has trained over 1,500 journalists, fact-checkers, activists, and OSCE scholarship holders. He has developed courses on media literacy, cybersecurity, statistics, and OSINT for institutions including The Moscow School of Social and Economic Sciences and the Faculty of Liberal Arts and Sciences Montenegro. He co-authored the Factcheck.Academy online course and Kazakhstan's first media literacy textbook, with research focused on hate speech, anti-vaccination disinformation, and propaganda narratives.

Subscribe to our newsletter to stay up to date with the Russia Program’s latest work

CONTACT US

INSTITUTE FOR EUROPEAN, RUSSIAN AND EURASIAN STUDIES 1957 E St NW Washington, DC 20052

1957 E St., NW, Suite 412,
Washington, DC 20052

russiaprogram@gwu.edu
+1 (202) 9946340

Campus
Advisories

EO/Nondiscrimination
Policy (PDF)

Privacy Notice

Accessibility

CONTACT US

1957 E St., NW, Suite 412,
Washington, DC 20052

russiaprogram@gwu.edu
+1 (202) 9946340

Campus
Advisories

Privacy Notice

EO/Nondiscrimination
Policy (PDF)

Accessibility

The Digital Methods School

Empowering researchers to unlock closed societies through open data

Sign up to join us

DIGITAL RESEARCH TRAINING ON RUSSIA

ORGANIZATION

Trainings are organized in a hybrid model:

Lectures distributed weekly

Workshops in intensive blocks

We strongly encourage the creation of knowledge commons and networking among participants. To foster this, we combine different platforms for communication and resource sharing: YouTube for tutorials and selected lectures, Discord for peer exchange, and Zoom for personalized interaction.

Curriculum

Introduction by Ivan Grek Tuesday March 3 18:00 CET / 12:00 PM EST

Lectures: Every Wednesday March 4 - April 15 18:00 CET / 12:00 PM EST

Workshops: Every Thursday March 5 - April 16 18:00 CET / 12:00 PM EST

Sign up to join us.March 3: Creative digital thinking, Introduction

by Ivan Grek, the director of the Russia Program

March 4 and 5: Seeing the Invisible: Studying Russia through Alternative and Indirect Data Sources

by Alexander Keysut, Senior Researcher, Cedar

March 11 and 12: Working with Russian Data Using AI: A Crash Course

by Damir Malikov, AI Lead at Kronika

March 18 and 19: Prompt Engineering: Unlocking the Full Power of AI

by Alexey Sidorenko, the director of Teplitsa. Technologies for Social Good

Understand how LLMs process instructions, tokens, and context windows

Design structured prompts for research, analytical, and policy-oriented tasks

Apply various techniques to improve depth and precision

Use role definition, constraints, and stepwise reasoning to enhance output quality

Identify bias, hallucinations, and uncertainty in model responses

Evaluate and validate AI-generated content for academic and professional use

The session frames prompting as a disciplined approach to human–AI interaction, with emphasis on reproducibility, epistemic reliability, and analytical accountability in politically sensitive and information-dense contexts.

March 25 and 26: From the Kremlin to Pandas

by Dr Bartłomiej Gajos, historian of Russia, visiting Fellow at Harvard University

The following questions will be addressed:

Why Russian (and other Slavic languages) are tricky for quantitative text analysis (many word forms, spelling and style variation),

How to make key preparation choices: normalisation, stemming vs lemmatisation, detecting duplicates and near-duplicates, and why these decisions change your results,

How to treat official sources critically (provenance, context, and what “the data” really represents),

How to go from PDF to dataset: extracting text from born-digital and scanned PDFs, cleaning Cyrillic text, adding metadata, and exporting to CSV / DataFrame.

April 1 and 2: Using AI for Russian Policy Analysis (Cybersecurity and Cybercensorship)

by Alexey Sidorenko, the director of Teplitsa. Technologies for Social Good

AI in Policy Analysis

Russian Policymaking System

Formal and Informal Institutions

Cybersecurity Policy

- Narrative and Discourse Analysis of Policy Documents and Speeches

Cybercensorship

- Introduction to Internet measurement domain

- Cybercensorship research approaches

April 8 and 9: Computational Text Analysis for Russian-Language Media

by Raushan Zhandayeva, PhD, Postdoctoral Associate at the Institute for Data, Democracy, and Politics, GWU

April 15 and 16: Monitoring narratives and identifying bot farms on social networks

by Pavel Bannikov, OSINT specialist

Core focus

Telegram as a key battlefield for pro‑Kremlin and anti‑Kremlin narratives, including semi‑closed ecosystems, comment floods, and anonymous political channels.

Deep search workflows: from keyword strategies and Boolean queries to using specialized OSINT tools and web-interfaces.

AI tools for narrative and bot‑network analysis: clustering posts, detecting emerging frames, spotting likely bots via content similarity, timing, and cross‑channel patterns.

Lecture

How does AI change propaganda worldwide?

Telegram channels, groups, and comment sections are used in Russian information operations and counter‑narratives.

Typical signatures of inauthentic behavior on social networks.

Tools and tips for monitoring Telegram, VK, and OK.

Integrating AI into OSINT: using LLMs and ML‑based tools for narrative labeling, topic clustering, sentiment, and anomaly detection, with validation and bias checks.

OPSEC: why you have to stay invisible and how.

How to work with LLMs locally without CLI.

Workshop

Working with real Telegram data (curated or brought by participants), small groups will:

Define the first appearance of narratives/viral posts.

Build a focused monitoring query around one narrative (e.g., “referendums,” “mobilization,” “Russophobia”) and collect posts/comments using deep search and open‑source tools.

Build and use AI‑assisted workflows to cluster messages, highlight recurrent frames, and flag suspected bot comments or channels.

OPTIONAL | Produce a short analytic note that combines narrative evolution, basic quantitative indicators, and a preliminary map of likely inauthentic activity.

Subscribe to our newsletter to stay up to date with the Russia Program’s latest work

SUBSCRIBE

CONTACT US

SEND

CONTACT US

SEND

Introduction by Ivan Grek
Tuesday March 3 18:00 CET / 12:00 PM EST

Sign up to join us.

March 3: Creative digital thinking, Introduction