Description
The attached file was created by the following steps:
Download http://www.gutenberg.org/files/6688/6688-0.txt from Project Gutenberg
Discard the first 151 lines: tail -n +151 6688-0.txt > 6688-1.txt
Running org.apache.tika.detect.DefaultDetector on this file returns “video/quicktime.” I am unsure as to why.
f = org.apache.tika.io.TikaInputStream.get(java.io.FileInputStream(fullname));
detector = org.apache.tika.detect.DefaultDetector;
md = org.apache.tika.metadata.Metadata;
fulltype = string(detector.detect(f,md));