【记录】用Python从pdf文件中提取文字数据信息

PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes than text analysis.

有空就可以用其继续去折腾了。

转载请注明：在路上 » 【记录】用Python从pdf文件中提取文字数据信息

Post Views: 1,166

【记录】用Python从pdf文件中提取文字数据信息

What’s It?

与本文相关的文章

Hi，您需要填写昵称和邮箱！