Skip to content

The Digital Orientalist

Practical examples and theoretical reflections on the do's and don'ts of using digital tools for your study and research in African and Asian Studies.

Primary Navigation

  • About
    • About Us
    • Team
    • Hall of Fame
    • Our Organizational Structure
    • Newsletter
  • Topics
    • African Studies
    • AI
    • Ancient Near Eastern Studies
    • Archiving
    • Between Legal and Illegal
    • Buddhist Studies
    • Central Asian Studies
    • Chinese Language
    • Coding
    • DH in General
    • DH in Practice
    • Digital Cartography
    • Digitization
    • Equipment
    • Events & Conferences
    • Hardware
    • Housekeeping
    • Indian Studies
    • Iranian Studies
    • Islamic Studies
    • Korean Studies
    • Japanese Studies
    • Mongolian Studies
    • OCR
    • Online Resources
    • Ottoman Studies
    • Sinology
    • Social Media
    • Software
    • South Asian Studies
    • Southeast Asian Studies
    • Syriac Studies
    • Tangut Studies
    • Teaching
    • Textual Analysis
    • Theory
    • Tibetan Studies
    • Turkic Studies
    • Visualization
    • Workflow
  • Submissions
    • Submission Guidelines
  • Publications
  • Conferences
    • 2025 – “AI and the Digital Humanities”
      • Titles and Abstracts
      • Conference Proceedings
    • 2023 – “Sustainability in the DH”
      • Conference Proceedings
    • 2022 – “Infrastructures”
      • Titles and Abstracts
    • 2021 – The Digital Orientalist’s Virtual Conference
      • Titles
    • 2020 – “Digital Orientalisms 2020”
    • 2019 – “Digital Orientalisms 2019”
  • Feedback and Inquiries
  • Donate
  • Search

Social Navigation

  • Facebook
  • Instagram
  • Bluesky
  • X
  • LinkedIn
  • YouTube

Category: OCR

Train Your Own OCR/HTR Models with Kraken, part 2
DH in Practice, Digitization, HTR, OCR, Online Resources, Software, Textual Analysis, Workflow

Train Your Own OCR/HTR Models with Kraken, part 2

Learn about Kraken’s segmentation model and the process of training our own custom segmentation models for layout analysis tasks. Continue reading Train Your Own OCR/HTR Models with Kraken, part 2

Current approaches on Automatic Recognition of Ethiopic script
African Languages, African Studies, DH in Practice, HTR, OCR, Online Resources, Textual Analysis

Current approaches on Automatic Recognition of Ethiopic script

The script used by the Christian cultures of Ethiopia is ancient, developing from the Sabaean script in the first centuries … Continue reading Current approaches on Automatic Recognition of Ethiopic script

Train Your Own OCR/HTR Models with Kraken, part 1
DH in Practice, Digitization, HTR, OCR, Online Resources, Software, Textual Analysis, Workflow

Train Your Own OCR/HTR Models with Kraken, part 1

How to train custom OCR/HTR models in Kraken Continue reading Train Your Own OCR/HTR Models with Kraken, part 1

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3
DH in Practice, Digitization, HTR, Indian Studies, OCR, Online Resources, Software, South Asian Studies, Workflow

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3

OCR of historical printing in Bengali using segmentation and recognition models trained in Kraken from an annotated dataset of Bengali texts published between 1860 and 1940. Continue reading eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3

Ancient Near Eastern Studies, Biblical Studies, Coding, DH in General, DH in Practice, OCR, Textual Analysis, Theory, Visualization, Workflow

Accessibility of Texts and Tools in Ancient Studies: Reframing the Discussion

This is a guest post by Sarah Blake LaRose. As a scholar of biblical studies who is blind, I often … Continue reading Accessibility of Texts and Tools in Ancient Studies: Reframing the Discussion

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 2
DH in Practice, Digitization, HTR, Indian Studies, Islamic Studies, OCR, Online Resources, Software, South Asian Studies, Workflow

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 2

In part 1 of this series, I provided a quick introduction to eScriptorium and the workflow associated with it. This … Continue reading eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 2

<strong>The Toyo Bunko Archive: a source of joy and torment</strong>
Buddhist Studies, DH in General, Digitization, New Post, OCR, Online Resources

The Toyo Bunko Archive: a source of joy and torment

As promised previously, in this post I am leading you in a deep dive into a major digital archive I … Continue reading The Toyo Bunko Archive: a source of joy and torment

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 1
DH in Practice, Digitization, HTR, Indian Studies, OCR, Online Resources, Software, South Asian Studies, Workflow

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 1

State-of-the-art OCR engines use trainable models to perform two consecutive tasks that produce machine-actionable transcriptions. They first segment the position … Continue reading eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 1

The Japanese Diaspora in Digital Sources: The Hoji Shinbun Digital Collection
DH in General, Japanese Studies, OCR, Online Resources

The Japanese Diaspora in Digital Sources: The Hoji Shinbun Digital Collection

All scholars engaged in the study of the Japanese diaspora can profit from the treasure trove of resources on the … Continue reading The Japanese Diaspora in Digital Sources: The Hoji Shinbun Digital Collection

Practicing Reading Cursive Japanese with Miwo
Japanese Studies, OCR, Software, Teaching

Practicing Reading Cursive Japanese with Miwo

Most people involved in Japanese studies with access to a smartphone or tablet will be aware of the kuzushiji (cursive … Continue reading Practicing Reading Cursive Japanese with Miwo

Posts navigation

Older posts
Newer posts
Powered by WordPress.com.
The Digital Orientalist | ISSN: 2772-8374

Contact: digitalorientalist@gmail.com

The Digital Orientalist
Proudly powered by WordPress Theme: Publication.

Loading Comments...