Skip to main content
Home

Search

16 results found.
  • Automatic Text Recognition (ATR) - Layout Analysis

    EN
    Discover the subtleties of region and line segmentation and learn about the purpose of layout analysis for Automatic Text Recognition!
    Authors, editors, and contributors
    • Alix Chagué
    • Hugo Scheithauer
    • Anne Baillot
  • Understanding and Creating Word Embeddings

    EN
    Word embeddings allow you to analyze the usage of different terms in a corpus of texts by capturing information about their contextual usage. Through a primarily theoretical lens, this lesson will teach you how to prepare a corpus and train a word embedding model. You will explore how word vectors work, how to interpret them, and how to answer humanities research questions using them.
    Authors, editors, and contributors
    • Avery Blankenship
    • Sarah Connell
    • Quinn Dombrowski
  • Automatic Text Recognition (ATR) - Getting Started

    EN
    Kick off your journey into Automatic Text Recognition (ATR) with our introductory tutorial video. This is the first video of a tutorial series dedicated to extracting full text from scanned images.
    Authors, editors, and contributors
    • Ariane Pinche
    • Pauline Spychala
    • Anne Baillot
  • Clustering and Visualising Documents Using Word Embeddings

    EN
    This lesson uses word embeddings and clustering algorithms in Python to identify groups of similar documents in a corpus of approximately 9,000 academic abstracts. It will teach you the basics of dimensionality reduction for extracting structure from a large corpus and how to evaluate your results.
    Authors, editors, and contributors
    • Jonathan Reades
    • Jennie Williams
    • Alex Wermer-Colan
  • Creating Deep Convolutional Neural Networks for Image Classification

    EN
    This lesson provides a beginner-friendly introduction to convolutional neural networks (CNNs) for image classification. The tutorial provides a conceptual understanding of how neural networks work by using Google's Teachable Machine to train a model on paintings from the ArtUK database. This lesson also demonstrates how to use Javascript to embed the model in a live website.
    Authors, editors, and contributors
    • Nabeel Siddiqui
    • Scott Kleinman
  • Interrogating a National Narrative with GPT-2

    EN
    In this lesson, you will learn how to apply a Generative Pre-trained Transformer language model to a large-scale corpus so that you can locate broad themes and trends within written text.
    Authors, editors, and contributors
    • Chantal Brousseau
    • John R Ladd
    • Tiago Sousa Garcia
  • The CLS INFRA Survey of Methods in Computational Literary Studies

    EN
    This resource from the CLS INFRA project offers an introduction to several research areas and issues that are prominent withinComputational Literary Studies (CLS), including authorship attribution, literary history, literary genre, gender in literature, and canonicity/prestige, as well as to several key methodological concerns that are of importance when performing research in CLS.
    Authors, editors, and contributors
    • Christof Schöch
    • Julia Dudar
    • Evegniia Fileva
  • Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 2)

    EN
    This is the second of a two-part lesson introducing deep learning based computer vision methods for humanities research. This lesson digs deeper into the details of training a deep learning based computer vision model. It covers some challenges one may face due to the training data used and the importance of choosing an appropriate metric for your model. It presents some methods for evaluating the performance of a model.
    Authors, editors, and contributors
    • Daniel van Strien
    • Kaspar Beelen
    • Melvin Wevers
  • Word Embeddings

    EN
    Natural language processing is one of the most powerful concepts in modern linguistics and computer science, bridging the understanding of language from human to machine, and in turn programming machines so they can perform complex linguistic tasks on their own. This short video introduces learners to the key concepts of word embeddings and how they can be used in digital humanities projects.
    Authors, editors, and contributors
    • Joseph Flanagan
  • What 300-Dimensional Fridges Can Tell Us about Language

    EN
    In this lecture from the Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH), Dirk Hovy gives an introduction to the method called embeddings, and showcases several applications of it. Hovy shows how they capture regional variation at an intra- and interlingual level, how they distinguish varieties and linguistic resources, and how they allow for the assessment of changing societal norms and associations.
    Authors, editors, and contributors
    • Dirk Hovy
    • Laura Still
    • Florian Wiencek