Open Discourse (doi:10.7910/DVN/FIKIBO)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Open Discourse

Identification Number:

doi:10.7910/DVN/FIKIBO

Distributor:

Harvard Dataverse

Date of Distribution:

2020-12-23

Version:

4

Bibliographic Citation:

Richter, Florian; Koch, Philipp; Franke, Oliver; Kraus, Jakob; Kuruc, Fabrizio; Thiem, Anja; Högerl, Judith; Heine, Stella; Schöps, Konstantin, 2020, "Open Discourse", https://doi.org/10.7910/DVN/FIKIBO, Harvard Dataverse, V4, UNF:6:GwkT9AA6a4VvjWm0LumTqw== [fileUNF]

Study Description

Citation

Title:

Open Discourse

Subtitle:

The first fully Comprehensive and Annotated Corpus of the Parliamentary Protocols of the German Bundestag

Identification Number:

doi:10.7910/DVN/FIKIBO

Authoring Entity:

Richter, Florian (Limebit GmbH)

Koch, Philipp (Limebit GmbH)

Franke, Oliver (Limebit GmbH)

Kraus, Jakob (Limebit GmbH)

Kuruc, Fabrizio (Limebit GmbH)

Thiem, Anja (Limebit GmbH)

Högerl, Judith (Limebit GmbH)

Heine, Stella (Limebit GmbH)

Schöps, Konstantin (Limebit GmbH)

Producer:

Limebit GmbH

Distributor:

Harvard Dataverse

Access Authority:

Richter, Florian

Depositor:

Richter, Florian

Date of Deposit:

2020-12-18

Holdings Information:

https://doi.org/10.7910/DVN/FIKIBO

Study Scope

Keywords:

Computer and Information Science, Social Sciences, Text data mining, Natural Language Processing, Bundestag, Plenary Minutes, Political Science, Speech, Parliament, Corpus Liguistics, Germany

Topic Classification:

Plenary Minutes

Abstract:

Data files of the Open Discourse corpus in different formats

Time Period:

1949-09-12-2020-11-06

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

File Description--f6544724

File: contributions_extended.tab

  • Number of cases: 2546212

  • No. of variables per record: 9

  • Type of File: text/tab-separated-values

Notes:

UNF:6:1pppZ31Cl92EZOp/lvVRFg==

File Description--f6544754

File: electoral_terms.tab

  • Number of cases: 20

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:PYUgVQelbAOy7Gzm7Gc3EQ==

File Description--f6544758

File: factions.tab

  • Number of cases: 28

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:EyUZbdzPXuQu4zsMv6qrHg==

File Description--f6544762

File: politicians.tab

  • Number of cases: 4386

  • No. of variables per record: 11

  • Type of File: text/tab-separated-values

Notes:

UNF:6:tj+g4C8hJvxYs+CFLdH3CA==

Variable Description

List of Variables:

Variables

id

f6544724 Location:

Summary Statistics: Mean 1273105.5; StDev 735028.236144988; Valid 2546212.0; Max. 2546211.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:N4h7XRFkDvHo763fW9vhHw==

type

f6544724 Location:

Variable Format: character

Notes: UNF:6:CzekgQFBjY+g77KrA6MMSA==

first_name

f6544724 Location:

Variable Format: character

Notes: UNF:6:TXT5UxyX1q9k8boq5wzXoQ==

last_name

f6544724 Location:

Variable Format: character

Notes: UNF:6:4eP1HtiGbj+RAzL+9MAz4g==

politician_id

f6544724 Location:

Summary Statistics: StDev 5139764.746207702; Mean 3540095.2148548886; Min. -1.0; Valid 2546212.0; Max. 1.1005301E7;

Variable Format: numeric

Notes: UNF:6:HqKF1F4Z4DcieS3F/YAgUQ==

content

f6544724 Location:

Variable Format: character

Notes: UNF:6:YHDyRa9ZhjZoLeE70CbN6w==

speech_id

f6544724 Location:

Summary Statistics: Valid 2546212.0; Mean 591047.5561371424; StDev 275924.8534132814; Max. 1072836.0; Min. 6.0;

Variable Format: numeric

Notes: UNF:6:4XuEL1GMuZgCqNT6kN+TNQ==

text_position

f6544724 Location:

Summary Statistics: Max. 495.0; Min. 0.0; Valid 2546212.0; Mean 7.472024324715805; StDev 11.395229801209377

Variable Format: numeric

Notes: UNF:6:N8xr25ImuMqiBtjOgHVMug==

faction_id

f6544724 Location:

Summary Statistics: StDev 8.60514583549721; Mean 11.291282108464268; Valid 2546212.0; Max. 26.0; Min. -1.0;

Variable Format: numeric

Notes: UNF:6:sz+5XanLifMXxHxBivAs/A==

id

f6544754 Location:

Summary Statistics: Min. 1.0; Max. 20.0; Mean 10.5; StDev 5.916079783099616; Valid 20.0;

Variable Format: numeric

Notes: UNF:6:/FIOZM/29oC3TK/IE52m2A==

start_date

f6544754 Location:

Summary Statistics: StDev 6.972244663232499E8; Min. -6.411744E8; Valid 20.0; Max. 1.6352928E9; Mean 4.9894272E8

Variable Format: numeric

Notes: UNF:6:tQWsbP0YEnj9MKH6isp18Q==

end_date

f6544754 Location:

Summary Statistics: Max. 1.761696E9; Valid 20.0; StDev 6.974579449041003E8; Min. -5.125248E8; Mean 6.1900416E8

Variable Format: numeric

Notes: UNF:6:kqhh8iw8JbY+yRPd6343CQ==

id

f6544758 Location:

Summary Statistics: StDev 8.225975119502044; Valid 28.0; Min. -1.0; Mean 12.5; Max. 26.0;

Variable Format: numeric

Notes: UNF:6:Gzx8DR6hu7/paArES61Y7w==

abbreviation

f6544758 Location:

Variable Format: character

Notes: UNF:6:dqnOdNYHts5X9AyFwTWoCw==

full_name

f6544758 Location:

Variable Format: character

Notes: UNF:6:3cfWdBgqKXDrsN4x/Q9GLw==

id

f6544762 Location:

Summary Statistics: Min. -1.0; StDev 166139.8386295936; Valid 4386.0; Mean 1.0999927752621979E7; Max. 1.1005308E7

Variable Format: numeric

Notes: UNF:6:Q2kn1H/8+6jSPYilpTz7XA==

first_name

f6544762 Location:

Variable Format: character

Notes: UNF:6:Rmih9CcMCssrgQe9TPJ6hw==

last_name

f6544762 Location:

Variable Format: character

Notes: UNF:6:zP2QPiUAdYNptpoUpnhxyw==

birth_place

f6544762 Location:

Variable Format: character

Notes: UNF:6:jF78M7n+LNGB2KnTWZpYFA==

birth_country

f6544762 Location:

Variable Format: character

Notes: UNF:6:cVoDvrVAmq++bAIDZKtr7A==

birth_date

f6544762 Location:

Variable Format: character

Notes: UNF:6:7kiCjvGhdnWnYzw+7UNWBQ==

death_date

f6544762 Location:

Variable Format: character

Notes: UNF:6:DSUU9uJr7AjqoqFGNoCtdg==

gender

f6544762 Location:

Variable Format: character

Notes: UNF:6:OmG0Lp+rXUGxZFMh4RsxMQ==

profession

f6544762 Location:

Variable Format: character

Notes: UNF:6:LO+lu+H9E+487wvSZS1Adw==

aristocracy

f6544762 Location:

Variable Format: character

Notes: UNF:6:n8C2DS34D52bBeTVI0I9DQ==

academic_title

f6544762 Location:

Variable Format: character

Notes: UNF:6:XnGRuIXICwD8EL/B4sLaIQ==

Other Study-Related Materials

Label:

contributions_simplified.csv

Text:

Notes:

text/csv

Other Study-Related Materials

Label:

speeches.csv

Text:

Notes:

text/csv

Other Study-Related Materials

Label:

contributions_extended.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

contributions_simplified.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

electoral_terms.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

factions.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

politicians.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

speeches.feather

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

contributions_extended.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

contributions_simplified.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

electoral_terms.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

factions.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

politicians.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

speeches.pkl

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

contributions_extended.RDS

Text:

Notes:

application/gzip

Other Study-Related Materials

Label:

contributions_simplified.RDS

Text:

Notes:

application/gzip

Other Study-Related Materials

Label:

electoral_terms.RDS

Text:

Notes:

application/gzip

Other Study-Related Materials

Label:

factions.RDS

Text:

Notes:

application/gzip

Other Study-Related Materials

Label:

politicians.RDS

Text:

Notes:

application/gzip

Other Study-Related Materials

Label:

speeches.RDS

Text:

Notes:

application/gzip