JPedal offers both automated tools and GUI viewer for converting PDF to text. It will take care of all the Encoding issues and give you text or XML with font information if required.
Text can be extracted from an entire PDF document, a single PDF page, from within page co-ordinates or from tables. PDF Font information and PDF metadata can also be extracted. If a PDF contains text, JPedal can extract it.
PDF text can be extracted as text or as XML content including font, colour and spacing information. JPedal offers a large range of PDF to text examples.
JPedal also has functions for structured content extraction if your PDF contains this optional metadata. This allows better accessibility (for US section 508 compliance) and presentation of extracted data from PDF files which contain this feature.