The part of the SmartDoc that does the extraction is Tika. But the SmartDoc does more than extracting text. It might be too much just for this. But if were you I’d check it out. I am sure that you’ll find it very useful - it also can serve you well in many other solutions.