How to use Apache Tika Server with password protected files? - Stack Overflow

I'm running Apache Tika Server in a docker container and trying to extract the text from PDFs cont

I'm running Apache Tika Server in a docker container and trying to extract the text from PDFs contained in a password protected ZIP file.

I've tried passing the password in the HTTP header as 'Password' and 'X-Tika-Password', however all it does is list the files in the ZIP folder without extracting the text.

If I remove the password from the ZIP file then it extracts the text from the PDFs perfectly.

I've tried this:

curl --location --request PUT '127.0.0.1:9998/tika' \
--header 'Accept: text/plain' \
--header 'Password: 123456' \
--header 'Content-Type: application/zip' \
--data-binary '@file/path/to.zip'

And just get back plain text with:

Name Of First File.pdf
Name of Second FIle.pdf

发布者:admin,转转请注明出处:http://www.yc00.com/questions/1745594513a4635028.html

相关推荐

发表回复

评论列表(0条)

  • 暂无评论

联系我们

400-800-8888

在线咨询: QQ交谈

邮件:admin@example.com

工作时间:周一至周五,9:30-18:30,节假日休息

关注微信