From Big Data to Argument Analysis and Automated Extraction: A Selective Study of Argument in the Philosophy of Animal Psychology from the Volumes of the Hathi Trust Collection

Project report


McAlister, Simon, Allen, Colin, Ravenscroft, A., Reed, Chris, Bourget, David, Lawrence, John, Börner, Katy and Light, Robert 2014. From Big Data to Argument Analysis and Automated Extraction: A Selective Study of Argument in the Philosophy of Animal Psychology from the Volumes of the Hathi Trust Collection. Digging by Debating.
AuthorsMcAlister, Simon, Allen, Colin, Ravenscroft, A., Reed, Chris, Bourget, David, Lawrence, John, Börner, Katy and Light, Robert
TypeProject report
Abstract

The Digging by Debating (DbyD) project aimed to identify, extract, model, map and visualise philosophical arguments in very large text repositories such as the Hathi Trust. The project has: 1) developed a method for visualizing points of contact between philosophy and the sciences; 2) used topic modeling to identify the volumes, and pages within those volumes, which are ‘rich’ in a chosen topic; 3) used a semiformal discourse analysis technique to
manually identify key arguments in the selected pages; 4) used the OVA argument mapping tool to represent and map the key identified arguments and provide a framework for
comparative analysis; 5) devised and used a novel analysis framework applied to the mapped arguments covering role, content and source of propositions, and the importance, context and meaning of arguments; 6) created a prototype tool for identifying propositions, using naive Bayes classifiers, and for identifying argument structure in chosen texts, using propositional similarity; 7) created tools to apply topic modeling to tasks of rating similarity of papers in the PhilPapers repository. The methods from 1 to 5 above, have enabled us to locate and extract the key arguments from each text. It is significant that, in applying the methods, a nonexpert with limited or no domain knowledge of philosophy has both identified the volumes of interest from a key ‘Big Data Set’ (Hathi Trust) AND identified key arguments within these texts. This provided several key insights about the nature and form of arguments
in historical texts, and is a proofofconcept design for a tool that will be usable by scholars. We have further created a dataset with which to train and test prototype tools for both proposition and argument extraction. Though at an early stage, these preliminary results are
promising given the complexity of the task. Specifically, we have prototyped a set of tools and methods that allow scholars to move between macroscale, global views of the distributions of philosophical themes in such repositories, and microscale analyses of the arguments appearing on specific pages in texts belonging to the repository. Our approach spans bibliographic analysis, science mapping, and LDA topic modeling conducted at Indiana University and machineassisted argument markup into Argument Interchange Format (AIF) using the OVA (Online Visualization of Argument) tool from the University of Dundee, where the latter has been used to analyse and represent arguments by the team based at the University of East London, who also performed a detailed empirical analysis of arguments in selected texts. This work has been articulated as a proof of concept tool – linked to the repository PhilPapers – designed by members linked to the University of London. This project is showing for the first time how big data text processing techniques can be combined with deep structural analysis to provide researchers and students with navigation and interaction tools for engaging with the large and
rich resources provided by datasets such as the Hathi Trust and PhilPapers. Ultimately our efforts show how the computational humanities can bridge the gulf between the “big data” perspective of firstgeneration digital humanities and the close readings of text that are the
“bread and butter” of more traditional scholarship in the humanities.

Year2014
PublisherDigging by Debating
Web address (URL)http://diggingbydebating.org/wp-content/uploads/2014/04/DiggingbyDebating-FinalReport2.pdf
Publication dates
PrintApr 2014
Publication process dates
Deposited11 Oct 2017
Copyright information© 2014 The authors
Publisher's version
Permalink -

https://repository.uel.ac.uk/item/85q66

Download files

  • 278
    total views
  • 147
    total downloads
  • 7
    views this month
  • 2
    downloads this month

Export as

Related outputs

New Deal for Young People Mentoring Research Report
Sharpe, D., Canitrot, D., Morocza, N., Ravenscroft, A., Mesa, L. P. G., Hanafiah, A. and Narayan, V. 2023. New Deal for Young People Mentoring Research Report. Institute for Connected Communities, University of East London.
Education 4.0: Is Characterising and Harmonising Intelligences a Way of Thinking about a Pedagogy 4.0 for Higher Education?
Bunce, M., Ravenscroft, A. and Richards, P. 2022. Education 4.0: Is Characterising and Harmonising Intelligences a Way of Thinking about a Pedagogy 4.0 for Higher Education? in: Malloch, M., Cairns, L., Evans, k. and O'Connor, B. N. (ed.) The SAGE Handbook of Learning and Work SAGE Publications. pp. 653-669
Addressing the Safety and Criminal Exploitation of Vulnerable Young People: Before, During and After COVID-19 and Lockdown
Ravenscroft, A., Salisbury, C., Voela, A. and Watts, P. 2021. Addressing the Safety and Criminal Exploitation of Vulnerable Young People: Before, During and After COVID-19 and Lockdown. in: Ellis, D. and Voela, A. (ed.) After Lockdown, Opening Up: Psychosocial Transformation in the Wake of COVID-19 Palgrave Macmillan. pp. 151-171
Participatory internet radio (RadioActive101) as a social innovation and co-production methodology for engagement and non-formal learning amongst socially excluded young people
Ravenscroft, A. 2020. Participatory internet radio (RadioActive101) as a social innovation and co-production methodology for engagement and non-formal learning amongst socially excluded young people. International Journal of Inclusive Education. 26 (6), pp. 541-558. https://doi.org/10.1080/13603116.2019.1700312
Finding and Interpreting Arguments: An Important Challenge for Humanities Computing and Scholarly Practice
Ravenscroft, A. and Allen, C. 2019. Finding and Interpreting Arguments: An Important Challenge for Humanities Computing and Scholarly Practice. Digital Humanities Quarterly. 13 (4).
Innovative psychoeducation interventions for ‘at-risk’ and socially excluded young people
Ravenscroft, A. 2019. Innovative psychoeducation interventions for ‘at-risk’ and socially excluded young people. Researching Education and Mental Health: Where are we now?. University of West London, UK. 12 Jul 2019 BERA: British Educational Research Association.
Politics, Public Pedagogy and Action: Beyond a Pedagogy of Hope
Ravenscroft, A. and Maisuria, A. 2018. Politics, Public Pedagogy and Action: Beyond a Pedagogy of Hope. Chrysochou, Polina and Hill, Dave (ed.) The 8th International Conference on Critical Education. Stratford, London, UK 25 - 28 Jul 2018 International Conference on Critical Education.
International Participatory Radio for the Inclusion and Non-formal Learning of Socially Excluded Young People
Ravenscroft, A., Dellow, J., Brites, Maria José, Jorge, Ana and Catalão, Daniel 2018. International Participatory Radio for the Inclusion and Non-formal Learning of Socially Excluded Young People. BERA Annual Conference 2018. Newcastle, UK 11 - 13 Sep 2018 British Educational Research Association.
RadioActive101-Learning through radio, learning for life: an international approach to the inclusion and non-formal learning of socially excluded young people
Ravenscroft, A., Dellow, J., Brites, M. J., Jorge, A. and Catalão, D. 2018. RadioActive101-Learning through radio, learning for life: an international approach to the inclusion and non-formal learning of socially excluded young people. International Journal of Inclusive Education. 24 (9), pp. 997-1018. https://doi.org/10.1080/13603116.2018.1503739
Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
Podobnik, Boris, Murdock, Jaimie, Allen, Colin, Börner, Katy, Light, Robert, McAlister, Simon, Ravenscroft, A., Rose, Robert, Rose, Doori, Otsuka, Jun, Bourget, David, Lawrence, John and Reed, Chris 2017. Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library. PLoS ONE. 12 (9), p. e0184188. https://doi.org/10.1371/journal.pone.0184188
Engagement before Ownership: Reflections on Participatory Radio as a Learning Intervention with Disenfranchised Groups
Ravenscroft, A., Rainey, C. and Dellow, J. 2016. Engagement before Ownership: Reflections on Participatory Radio as a Learning Intervention with Disenfranchised Groups. in: Proceedings of Online Educa Berlin (OEB) ICWE GmbH. pp. 124-127
Metodologias Participativas: Os media e a educação Participatory Methodologies: Media and education
Ravenscroft, A., Rainey, C., Brites, Maria José, Santos, Sílvio Correia, Dahn, Ingo and Dellow, J. 2015. Metodologias Participativas: Os media e a educação Participatory Methodologies: Media and education. in: Brites, Maria José, Santos, Sílvio Correia and Jorge, Ana (ed.) Participatory Methodologies: Media and education Covilhã, Portugal Livros LabCom. pp. 37-45
RadioActive101 Practices
Brites, Maria José, Ravenscroft, A., Dellow, J., Rainey, C., Jorge, Ana, Santos, Sílvio Correia, Rees, Angela, Auwärter, Andreas, Catalão, Daniel, Balica, Magdalena and Camilleri, Anthony F. 2015. RadioActive101 Practices. Porto, Portugal ePublished.
RadioActive Europe: promoting engagement, informal learning and employability of at risk and excluded people across Europe through internet radio and social media (RadioActive101)
Ravenscroft, A., Rainey, C., Dellow, J., Brites, Maria José, Auwärter, Andreas, Balica, Magdalena, Rees, Angela, Camilleri, Anthony F., Jorge, Ana, Dahn, Ingo and Fenech, Justin 2015. RadioActive Europe: promoting engagement, informal learning and employability of at risk and excluded people across Europe through internet radio and social media (RadioActive101). Education, Audiovisual & Culture Executive Agency and European Commission.
Deep Learning Design for Social Innovation: Participatory Radio for Developing 21C Skills with Disenfranchised Learners
Ravenscroft, A., Rainey, C. and Dellow, J. 2015. Deep Learning Design for Social Innovation: Participatory Radio for Developing 21C Skills with Disenfranchised Learners. in: Garreta-Domingo, Muriel, Sloep, Peter, Stoyanov, Slavi, Hernández-Leo, Davinia and Mor, Yishay (ed.) Proceedings of the workshop “Design for Learning in Practice”, EC-TEL, Toledo, Sept. 18, 2015 Heerlen, Nederland HANDS-ON. pp. 23-28
RadioActive101: Using internet radio to break-down the boundaries for inclusion into smart cities
Ravenscroft, A., Rainey, C., Brites, Maria José, Correia Santos, Silvio and Dellow, J. 2014. RadioActive101: Using internet radio to break-down the boundaries for inclusion into smart cities. International Workshop on Smart City Learning, European Conference on Technology Enhanced Learning 2014. Graz, Austria 16 Sep 2014
RadioActive101: Implementation and Evaluation of Internet Radio as an Educational Intervention for Inclusion, Informal Learning and Employability, as part of Addressing Inequality: Is ICT a Silver Bullet?
Ravenscroft, A. 2014. RadioActive101: Implementation and Evaluation of Internet Radio as an Educational Intervention for Inclusion, Informal Learning and Employability, as part of Addressing Inequality: Is ICT a Silver Bullet? Online-Educa. Berlin 02 - 04 Dec 2014
Reflections on the acceptance and success of RadioActive101: Motivation through problematisation, improved well-being,emancipation and extreme learning
Ravenscroft, A., Rainey, C., Brites, Maria José, Correia, Silvio Santos, Catalão, Daniel, Dahn, Ingo and Dellow, J. 2015. Reflections on the acceptance and success of RadioActive101: Motivation through problematisation, improved well-being,emancipation and extreme learning. in: Holocher-Ertl, Teresa, Kunzmann, Christine, Müller, Lars, Rivera-Pelayo, Verónica, Schmidt, Andreas P. and Wolf, Carmen (ed.) Motivational and Affective Aspects in Technology Enhanced Learning (MATEL) : Proceedings of the MATEL Workshop 2013-2014 Karlsruhe, Germany Karlsruhe Institute of Technology.
Implementing and Evaluating the ‘space' of Participatory Radio as an Educational Intervention for Inclusion, Self-efficacy and Informal Learning
Ravenscroft, A., Balica, Magdelena, Fartusnic, Ciprian and Rainey, C. 2015. Implementing and Evaluating the ‘space' of Participatory Radio as an Educational Intervention for Inclusion, Self-efficacy and Informal Learning. British Educational Research Association’s Annual Conference. Belfast, UK 15 - 17 Sep 2015 pp. 1-1
RadioActive: inclusive informal learning and employability through international internet radio
Ravenscroft, A. 2013. RadioActive: inclusive informal learning and employability through international internet radio. UEL Research and Knowledge Exchange Conference 2013. University of East London, London 26 Jun 2013 London University of East London.