Seems to me, then, that Adobe would be the best free answer.
Smart Score can do what you're talking about (including automatically identifying/recognizing words in the scanned document. But, like I said, it ain't free and I can't find the demo.
Ransac, cpa trash man