<codeBook xmlns="ddi:codebook:2_5" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="ddi:codebook:2_5 https://ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" version="2.5"><docDscr><citation><titlStmt><titl>Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches</titl><IDNo agency="DOI">doi:10.7910/DVN/SGI0VK</IDNo></titlStmt><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><distDate>2026-01-27</distDate></distStmt><verStmt source="archive"><version date="2026-01-27" type="RELEASED">1</version></verStmt><biblCit>Blinova, Daria; Gayathri Emuru; Rakesh Emuru; Kushagradheer Shridheer Srivastava; Rulis, Mina; Sunita Chandrasekaran; Bagozzi, Benjamin, 2026, "Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches", https://doi.org/10.7910/DVN/SGI0VK, Harvard Dataverse, V1</biblCit></citation></docDscr><stdyDscr><citation><titlStmt><titl>Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches</titl><IDNo agency="DOI">doi:10.7910/DVN/SGI0VK</IDNo></titlStmt><rspStmt><AuthEnty affiliation="University of Delaware">Blinova, Daria</AuthEnty><AuthEnty affiliation="University of Delaware">Gayathri Emuru</AuthEnty><AuthEnty affiliation="University of Delaware">Rakesh Emuru</AuthEnty><AuthEnty affiliation="University of Delaware">Kushagradheer Shridheer Srivastava</AuthEnty><AuthEnty affiliation="University of Pennsylvania">Rulis, Mina</AuthEnty><AuthEnty affiliation="University of Delaware">Sunita Chandrasekaran</AuthEnty><AuthEnty affiliation="University of Delaware">Bagozzi, Benjamin</AuthEnty></rspStmt><prodStmt/><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><contact affiliation="University of Delaware" email="bagozzib@udel.edu">Bagozzi, Benjamin</contact><depositr>Bagozzi, Benjamin</depositr><depDate>2026-01-13</depDate></distStmt><holdings URI="https://doi.org/10.7910/DVN/SGI0VK"/></citation><stdyInfo><subject><keyword xml:lang="en">Computer and Information Science</keyword><keyword xml:lang="en">Social Sciences</keyword></subject><abstract>This Dataverse entry incldues a dataset of interlinked multimodal political communications from the Russian government, addressing  persistent deficiencies in the availability of social text- and image-based data for authoritarian politics contexts. The dataset comprises two large corpora of official speeches delivered by senior actors within the Kremlin and the Russian Ministry of Foreign Affairs over multiple decades. For each speech, we provide Russian- and English-language texts, associated images and captions where available, and harmonized metadata including (e.g.) dates, speakers, (geo)locations, and official government content tags. Unique identifiers link images to speeches and align Russian and English versions of the same communication texts. We further augment these linked datasets with validated topical annotations for both speech texts and speech images, which are generated via transformer-based multimodal topic modeling and refined by a Russian politics expert. The resulting data resources support multimodal, multilingual, temporal, and/or spatial analyses of (authoritarian) political communication and offer a valuable testbed for social science research and large language model (LLM) applications in political domains.</abstract><sumDscr/></stdyInfo><method><dataColl><sources/></dataColl><anlyInfo/></method><dataAccs><setAvail/><useStmt/><notes type="DVN:TOU" level="dv">&lt;a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0&lt;/a></notes></dataAccs><othrStdyMat/></stdyDscr><otherMat ID="f13403176" URI="https://dataverse.harvard.edu/api/access/datafile/13403176" level="datafile"><labl>kremlin_english_images.zip</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13402391" URI="https://dataverse.harvard.edu/api/access/datafile/13402391" level="datafile"><labl>kremlin_mid_en_ru_auxiliary_files.zip</labl><txt>Auxiliary outputs from BERTopic topic modeling for Kremlin &amp; MID corpora (EN/RU): interactive HTML topic explorers and long-format topic-probability files for text and images</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13402362" URI="https://dataverse.harvard.edu/api/access/datafile/13402362" level="datafile"><labl>kremlin_mid_en_ru_final_csvs.zip</labl><txt>Final curated CSVs for all four corpora (Kremlin EN/RU, MID EN/RU), including metadata + curated text/image topic IDs, labels, groups, and topic probabilities.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13403151" URI="https://dataverse.harvard.edu/api/access/datafile/13403151" level="datafile"><labl>kremlin_russian_images.zip</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13403125" URI="https://dataverse.harvard.edu/api/access/datafile/13403125" level="datafile"><labl>mid_english_images.zip</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13403126" URI="https://dataverse.harvard.edu/api/access/datafile/13403126" level="datafile"><labl>mid_russian_images.zip</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/zip</notes></otherMat><otherMat ID="f13405778" URI="https://dataverse.harvard.edu/api/access/datafile/13405778" level="datafile"><labl>README.txt</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat></codeBook>