Category Archives: Workflow

Cursive Japanese and OCR: Using KuroNet

The Center for Open Data in the Humanities’ KuroNet Kuzushiji Ninshiki Sābisu (KuroNetくずし字認識サービス) launched late last year. KuroNet is a free OCR (Optical Character Recognition) platform which allows users to convert images of documents written in cursive Japanese into printed

Cursive Japanese and OCR: Using KuroNet

The Center for Open Data in the Humanities’ KuroNet Kuzushiji Ninshiki Sābisu (KuroNetくずし字認識サービス) launched late last year. KuroNet is a free OCR (Optical Character Recognition) platform which allows users to convert images of documents written in cursive Japanese into printed

Photographing Archival Material at the Cadbury Research Library: Some Reflections

I am indebted to both the Cadbury Research Library at the University of Birmingham and the Church Mission Society for providing me permission to print the images used herein. In the summer of 2019, I spent one month at the

Photographing Archival Material at the Cadbury Research Library: Some Reflections

I am indebted to both the Cadbury Research Library at the University of Birmingham and the Church Mission Society for providing me permission to print the images used herein. In the summer of 2019, I spent one month at the

Using Kraken to Train your own OCR Models

This is a contribution by Christine Roughan of NYU. Connect with her on Twitter @cmroughan Over the summer of 2019, inspired by the promising results in articles like Romanov et al. 2017, I set out to use the Kraken OCR software on a variety of texts. Kraken, see their website or their repository, is open-source command line software that is capable

Using Kraken to Train your own OCR Models

This is a contribution by Christine Roughan of NYU. Connect with her on Twitter @cmroughan Over the summer of 2019, inspired by the promising results in articles like Romanov et al. 2017, I set out to use the Kraken OCR software on a variety of texts. Kraken, see their website or their repository, is open-source command line software that is capable

Making a Basic Textual Analysis program in Python

Whether we are involved in Japanese Studies or Islamic Studies, Near Eastern Studies or African Studies, we are all likely to interact with historical texts written in Romance and Germanic Languages, and for our research, we may want or need

Making a Basic Textual Analysis program in Python

Whether we are involved in Japanese Studies or Islamic Studies, Near Eastern Studies or African Studies, we are all likely to interact with historical texts written in Romance and Germanic Languages, and for our research, we may want or need

Creating a “Launcher” Extension for Google Chrome

Users of Google Chrome have long been able to customize their browsing experiences by installing extensions; small pieces of software that ‘enable users to tailor Chrome functionality and behaviour to individual needs or preferences’ (Google). Extensions are written in HTML,

Creating a “Launcher” Extension for Google Chrome

Users of Google Chrome have long been able to customize their browsing experiences by installing extensions; small pieces of software that ‘enable users to tailor Chrome functionality and behaviour to individual needs or preferences’ (Google). Extensions are written in HTML,

Google Translate with One Click (Mac)

Yes we all know; Google translator is best described as “quick and dirty”. Nonetheless, we all use it because it is very convenient and helpful. Just like the title suggests, this trick will save you the time of copying the

Google Translate with One Click (Mac)

Yes we all know; Google translator is best described as “quick and dirty”. Nonetheless, we all use it because it is very convenient and helpful. Just like the title suggests, this trick will save you the time of copying the

Announcing a Handbook for DH and Manuscript Studies

Millions of documents have been scanned and stored as images of pages. Now what? For the past 1,5 years I have been writing about this exact question. In a nutshell, I am preparing a handbook for bringing together Digital Humanities and

Announcing a Handbook for DH and Manuscript Studies

Millions of documents have been scanned and stored as images of pages. Now what? For the past 1,5 years I have been writing about this exact question. In a nutshell, I am preparing a handbook for bringing together Digital Humanities and