Skip to content

The Digital Orientalist

Practical examples and theoretical reflections on the do's and don'ts of using digital tools for your study and research in African and Asian Studies.

Primary Navigation

  • About
    • About The Digital Orientalist
    • Team
    • Hall of Fame
    • Newsletter
  • Topics
    • African Studies
    • African Languages
    • Ancient Near Eastern Studies
    • Archiving
    • Between Legal and Illegal
    • Buddhist Studies
    • Chinese Language
    • Coding
    • DH in General
    • DH in Practice
    • Digital Cartography
    • Digitization
    • Equipment
    • Events & Conferences
    • Hardware
    • Housekeeping
    • Indian Studies
    • Islamic Studies
    • Iranian Studies
    • Islamic Languages
    • Korean Studies
    • Japanese Studies
    • Mongolian Studies
    • OCR
    • Online Resources
    • Ottoman Studies
    • Sinology
    • Social Media
    • Software
    • Syriac Studies
    • Teaching
    • Textual Analysis
    • Theory
    • Using Real Paper
    • Visualization
    • Workflow
  • Submissions
    • Submission Guidelines
  • Publications
  • The Digital Orientalist’s Conferences
    • 2025 – “AI and the Digital Humanities”
      • Titles and Abstracts
      • Conference Proceedings
    • 2023 – “Sustainability in the DH”
      • Conference Proceedings
    • 2022 – “Infrastructures”
      • Titles and Abstracts
    • 2021 – The Digital Orientalist’s Virtual Conference
      • Titles
    • 2020 – “Digital Orientalisms 2020”
  • Donate
  • Search
  • ISSN: 2772-8374

Social Navigation

  • X
  • Facebook
  • Instagram
  • YouTube
  • BlueSky
  • LinkedIn

Category: OCR

Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part Two)
AI, DH in Practice, New Post, OCR, Online Resources, Sinology

Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part Two)

This is a guest post by Jiajun Zou. See bio at the end of this post. Find part one here. Part … Continue reading Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part Two)

Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part One)
AI, DH in Practice, New Post, OCR, Sinology, Visualization, Workflow

Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part One)

This is a guest post by Jiajun Zou. See bio at the end of this post. Part I Tianyige Ming … Continue reading Creating the largest Juren Dataset with ChatGPT: A Journey through Digital Humanities. (Part One)

An Experiment with Gemini Pro LLM for Chinese OCR and Metadata Extraction
AI, Chinese Language, Digitization, New Post, OCR

An Experiment with Gemini Pro LLM for Chinese OCR and Metadata Extraction

This is a guest post by Eric H. C. Chow. For more information, see at the end of this post. … Continue reading An Experiment with Gemini Pro LLM for Chinese OCR and Metadata Extraction

An Interview on DASH: Digital Analysis of Syriac Handwriting
Apps, DH in Practice, HTR, OCR, Online Resources, Syriac Studies, Textual Analysis

An Interview on DASH: Digital Analysis of Syriac Handwriting

In previous years I have posted some interviews with different projects to trace the history of the Syriac Digital Humanities. … Continue reading An Interview on DASH: Digital Analysis of Syriac Handwriting

The Digitization of a Large Latin-Chinese Dictionary
Chinese Language, DH in Practice, Digitization, OCR, Software

The Digitization of a Large Latin-Chinese Dictionary

This is a guest post by Christopher Francese, Asbury J. Clarke Professor of Classical Studies, Dickinson College francese@dickinson.edu In 2016 … Continue reading The Digitization of a Large Latin-Chinese Dictionary

Transkribus in the Classroom
Apps, DH in Practice, HTR, OCR, Online Resources, Teaching, Textual Analysis

Transkribus in the Classroom

While others have discussed the possibilities in utilizing Transkribus and other HTR resources in research, I want to briefly discuss … Continue reading Transkribus in the Classroom

Train Your Own OCR/HTR Models with Kraken, part 2
DH in Practice, Digitization, HTR, OCR, Online Resources, Software, Textual Analysis, Workflow

Train Your Own OCR/HTR Models with Kraken, part 2

Learn about Kraken’s segmentation model and the process of training our own custom segmentation models for layout analysis tasks. Continue reading Train Your Own OCR/HTR Models with Kraken, part 2

Current approaches on Automatic Recognition of Ethiopic script
African Languages, African Studies, DH in Practice, HTR, OCR, Online Resources, Textual Analysis

Current approaches on Automatic Recognition of Ethiopic script

The script used by the Christian cultures of Ethiopia is ancient, developing from the Sabaean script in the first centuries … Continue reading Current approaches on Automatic Recognition of Ethiopic script

Train Your Own OCR/HTR Models with Kraken, part 1
DH in Practice, Digitization, HTR, OCR, Online Resources, Software, Textual Analysis, Workflow

Train Your Own OCR/HTR Models with Kraken, part 1

How to train custom OCR/HTR models in Kraken Continue reading Train Your Own OCR/HTR Models with Kraken, part 1

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3
DH in Practice, Digitization, HTR, Indian Studies, OCR, Online Resources, Software, South Asian Studies, Workflow

eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3

OCR of historical printing in Bengali using segmentation and recognition models trained in Kraken from an annotated dataset of Bengali texts published between 1860 and 1940. Continue reading eScriptorium: Digital Text Production for Urdu, Hindi, and Bengali Print, part 3

Posts navigation

Older posts
Newer posts
Website Powered by WordPress.com.
The Digital Orientalist
Website Powered by WordPress.com.
  • Subscribe Subscribed
    • The Digital Orientalist
    • Join 334 other subscribers
    • Already have a WordPress.com account? Log in now.
    • The Digital Orientalist
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...