«Arabic OCR»: الفرق بين المراجعتين

من ويكي عربآيز
اذهب إلى: تصفح، ابحث
(Current Status of Arabic OCR software)
(Software)
سطر 30: سطر 30:
 
===Software===
 
===Software===
 
* [http://www.irislink.com/c2-532/OCR-Software---Product-list.aspx Readiris] - Supports Arabic and Persian
 
* [http://www.irislink.com/c2-532/OCR-Software---Product-list.aspx Readiris] - Supports Arabic and Persian
* [http://www.novodynamics.com NovoDynamics VERUS] - Focuses on high-performance OCR and image enhancement for Arabic-based scripts, including Arabic, Persian, Pashto, Urdu.
+
* [http://www.novodynamics.com NovoDynamics VERUS] - High-performance Optical Character Recognition and image enhancement for Arabic-based scripts, including Farsi, Pashto, Urdu and Arabic OCR.
   
 
'''FOSS''' "no Arabic support yet"
 
'''FOSS''' "no Arabic support yet"

نسخة 19:40، 27 فبراير 2007

Optical Character Recognition

OCR is the ability to scan a document (or grab a PDF file) and run an OCR program on it and it will generate, based on optical recognition and approximation, an editable text file. For an idea about OCR see http://www.students.cs.uu.nl/people/mjkammer/Work/intro_2_OCR.html

Current Status of Open Source Arabic OCR software

I (MuhammadAlkarouri) know of no actually working Arabic OCR software that is open source. Any additions are certainly welcome.

Siragi-OCR

SIRAGI is an open source software designed to help blind and partially sighted people working with their computer. Visually impaired people can use this program to "listen" the content of their screen under windows or Linux/KDE. The main advantage of using SIRAGI is the support of arabic language for braille language and for speech synthesis in arabic.

As a part of This, Siragi's developer started devoloping a FOSS OCR that should support Arabic.

Siragi's Arabeyes Page

Resources

Arabic OCR Links

Papers

Software

  • Readiris - Supports Arabic and Persian
  • NovoDynamics VERUS - High-performance Optical Character Recognition and image enhancement for Arabic-based scripts, including Farsi, Pashto, Urdu and Arabic OCR.

FOSS "no Arabic support yet"

  • Tesseract is an open source OCR, initially developed by HP, and released under the Apache License.
  • OOCR OOCR is an OCR program still in development, under the GPL.
  • GOCR - included in Debian and other distributions.
  • GNU Ocrad "is an OCR [...] program based on a feature extraction method".

Other Links