Is there a way to extract all content from mupdf? For example the following code using the GetText() method will extract all text in html format:
using MuPDF.NET
var document = new Document("path-to-doc.pdf")
for (int i = 0; i < document.PageCount; i++) {
var htmlContent = page.GetText("html");
}
this will not necessairly include form fields, vector graphics e.t.c. How would i get all of these and their relative positions within the PDF?
发布者:admin,转转请注明出处:http://www.yc00.com/questions/1745117917a4612241.html
评论列表(0条)