add note on how to use tika

This commit is contained in:
Uwe Steinmann 2021-07-20 16:31:44 +02:00
parent 048a9d8b82
commit b0804f5562

View File

@ -27,6 +27,12 @@ text/html
Many office formats
unoconv -d document -f txt --stdout '%s'
Apache Tika is another option for creating plain text from various document
types. Just use curl to send the document to your tika server and get the
plain text in return.
curl -s -T '%s' http://localhost:9998/tika --header 'Accept: text/plain'
Conversion to pdf for pdf preview
==================================