Skip to content

A tool for analyzing how leaders of great powers discuss foreign policy. It scrapes official cables and provides analytical tools for comparing rhetorical trends over time and across regions.

Notifications You must be signed in to change notification settings

ccoop129/speech-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-analysis

A project scraping, analyzing, and visualizing speeches from the Russian and Chinese foreign ministries by Chris Cooper, Jack Lennon, and Mandy Tao.

Datasets

Using the china-speech-scraper and russia-speech-scraper, we gathered china_speeches.csv, Lavrov_2026_2014_D4P.csv, and Lavrov_Speeches_D4P.json from the Russian and Chinese Ministries of Foreign Affairs.

We used keywords.py to make viz_cache.json, which is what generates the interactable chart on the Insights page.

We used R to join the two datasets into China_Russia_Speeches.csv and .json.

We used word_count.py to make CH_RU_processed_lemmatized.json

We used count_noun_chunks_entities.py to make noun_chunks_entities_count.json, top_1000_words_combined.json, china_top_1000_words.json and russia_top_1000_words.json.

We used tfidf_analysis.py to make tfidf_rsults.json, tfidf_results_china_edited.json and tfidif_results_russia_edited.json.

About

A tool for analyzing how leaders of great powers discuss foreign policy. It scrapes official cables and provides analytical tools for comparing rhetorical trends over time and across regions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published