IDHMC Lead Programmer Matt Christy has been working with Gamera over the past few weeks, as we’ve chosen three open-source OCR engines to test our theory for improving the OCR of early modern texts. We hope you’ll visit the post for a look at the lessons we’ve learned and the progress we’ve made.
Especially illuminating is Matt’s description of Gamera, the software’s OCR toolkit, and the collaborative effort it has taken for eMOP to come to certain conclusions about the effectiveness of the software for large data sets.
Matt’s post can be found on the eMOP site at this address.
Make sure to bookmark the eMOP software page for current and future posts by the team! The IDHMC’s eMOP team will continuing to post in the coming weeks, as we approach some major milestones in our Mellon grant schedule.