10.26180/5c844c7a81768 Gede Primahadi Wijaya Rajeg Gede Primahadi Wijaya Rajeg R codes and dataset for <i>Visualisation of Diachronic Constructional Change using Motion Chart</i> Monash University 2019 linguistic motion charts motion charts GoogleVis constructional change diachronic corpus linguistics data visualisations data visualizations data visualisation Corpus of Historical American English COHA R programming language reshape2 English future constructions collocational change Programming Languages Digital Humanities Data Communications English Language Language Studies not elsewhere classified Language in Time and Space (incl. Historical Linguistics, Dialectology) Natural Language Processing Linguistic Structures (incl. Grammar, Phonology, Lexicon, Semantics) Computational Linguistics Linguistics not elsewhere classified Linguistics 2019-03-10 06:18:48 Dataset https://bridges.monash.edu/articles/dataset/R_codes_and_dataset_for_i_Visualisation_of_Diachronic_Constructional_Change_using_Motion_Chart_i_/7823807 <b>Publication</b><div><b><br></b><div>Primahadi Wijaya R., Gede. 2014. Visualisation of diachronic constructional change using Motion Chart. In Zane Goebel, J. Herudjati Purwoko, Suharno, M. Suryadi & Yusuf Al Aried (eds.). <i>Proceedings: International Seminar on Language Maintenance and Shift IV (LAMAS IV)</i>, 267-270. Semarang: Universitas Diponegoro. doi: <a href="https://doi.org/10.4225/03/58f5c23dd8387">https://doi.org/10.4225/03/58f5c23dd8387</a></div></div><div><br></div><div><b>Description of R codes and data files in the repository</b></div><div><b><br></b></div><div>This repository is imported from its <a href="https://github.com/gederajeg/motion-charts-futurate">GitHub repo</a>. Versioning of this figshare repository is associated with the GitHub repo's <a href="https://github.com/gederajeg/motion-charts-futurate/releases">Release</a>. So, check the <a href="https://github.com/gederajeg/motion-charts-futurate/releases">Releases</a> page for updates (the next version is to include the unified version of the codes in the first release with the <a href="https://www.tidyverse.org">tidyverse</a>).</div><div><br></div><div>The raw input data consists of two files (i.e. <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/will_INF.txt">will_INF.txt</a></code> and <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/go_INF.txt">go_INF.txt</a></code>). They represent the co-occurrence frequency of top-200 infinitival collocates for <i>will</i> and <i>be going to</i> respectively across the twenty decades of <a href="https://www.english-corpora.org/coha/">Corpus of Historical American English</a> (from the 1810s to the 2000s).</div><div><br></div><div>These two input files are used in the R code file <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/1-script-create-input-data-raw.r">1-script-create-input-data-raw.r</a></code>. The codes preprocess and combine the two files into a long format data frame consisting of the following columns: (i) <code>decade</code>, (ii) <code>coll</code> (for "collocate"), (iii) <code>BE going to</code> (for frequency of the collocates with <i>be going to</i>) and (iv) <code>will</code> (for frequency of the collocates with <i>will</i>); it is available in the <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/input_data_raw.txt">input_data_raw.txt</a></code>. </div><div><br></div><div>Then, the script <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/2-script-create-motion-chart-input-data.R">2-script-create-motion-chart-input-data.R</a></code> processes the <code>input_data_raw.txt</code> for normalising the co-occurrence frequency of the collocates per million words (the COHA size and normalising base frequency are available in <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/coha_size.txt">coha_size.txt</a></code>). The output from the second script is <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/input_data_futurate.txt">input_data_futurate.txt</a></code>.</div><div><br></div><div>Next, <code>input_data_futurate.txt</code> contains the relevant input data for generating (i) the static motion chart as an image <a href="https://monash.figshare.com/articles/Visualisation_of_Diachronic_Constructional_Change_Using_Motion_Chart/4884254/2">plot in the publication</a> (using the script <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/3-script-create-motion-chart-plot.R">3-script-create-motion-chart-plot.R</a></code>), and (ii) the <a href="https://primahadiwijaya.blogspot.com/2014/09/motion-chart-for-futurate-constructions.html">dynamic motion chart</a> (using the script <code><a href="https://github.com/gederajeg/motion-charts-futurate/blob/master/4-script-motion-chart-dynamic.R">4-script-motion-chart-dynamic.R</a></code>).</div><div><br></div><div>The repository adopts the <a href="https://www.tidyverse.org/articles/2017/12/workflow-vs-script/">project-oriented workflow</a> in RStudio; double-click on the <code>Future Constructions.Rproj</code> file to open an RStudio session whose working directory is associated with the contents of this repository.</div>