Recently, Google updated its engine on many smartphones in order to add the new technology of Google Lens to its … Continue reading Google Lens for Syriac: Something Groundbreaking?
Recently, I have been experimenting to see how well Google Docs can transcribe and perform OCR on Syriac books in … Continue reading Brief Notes on OCR and the Automated Transcription of Syriac Books
This article presents the project of digitization of a large amount of handwritten material that is part of the Fondo Daniélou at Cini Foundation (San Giorgio, Venice). Continue reading Digitizing Indian Music
When we cannot find a digitized version on the internet, we photograph or scan a book or article ourselves. We … Continue reading ScanTailor: Installation Instructions and Impressions
Recently, as a result of my current research project —an edition and translation of al-Maqrizi’s fifteenth-century chronicle al-Suluk for the Ayyubid period … Continue reading Some Thoughts about Arabic-Script OCR
The Center for Open Data in the Humanities’ KuroNet Kuzushiji Ninshiki Sābisu (KuroNetくずし字認識サービス) launched late last year. KuroNet is a … Continue reading Cursive Japanese and OCR: Using KuroNet
This is a contribution by Christine Roughan of NYU. Connect with her on Twitter @cmroughan Over the summer of 2019, inspired by the promising results in articles like Romanov et al. 2017, I set out to use the … Continue reading Using Kraken to Train your own OCR Models