Our systems are now restored following recent technical disruption, and we’re working hard to catch up on publishing. We apologise for the inconvenience caused. Find out more

Recommended product

Popular links

Popular links


Programming for Corpus Linguistics with Python and Dataframes

Programming for Corpus Linguistics with Python and Dataframes

Programming for Corpus Linguistics with Python and Dataframes

Daniel Keller , Western Kentucky University
June 2024
Available
Paperback
9781108822589

Looking for an examination copy?

This title is not currently available for examination. However, if you are interested in the title for your course we can consider offering an examination copy. To register your interest please contact [email protected] providing details of the course you are teaching.

    This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

    Product details

    June 2024
    Paperback
    9781108822589
    114 pages
    230 × 150 × 5 mm
    0.18kg
    Available

    Table of Contents

    • 1. Data frame corpora
    • 2. Python basics for corpus linguistics
    • 3. Working with data frames
    • 4. Algorithms for common corpus linguistic tasks
    • 5. Creating data frame corpora
    • 6. Conclusion
    • References.
    Resources for
    Type
    Keller - Supp Mat
    Size: 359.8 MB
    Type: application/zip
      Author
    • Daniel Keller , Western Kentucky University