By Chris A. Mattmann and Jukka L. Zitting
In this article, based on chapter 4 of Tika in Action, the authors discuss Shared MIMEinfo Database specification, which, among other things, defines an XML format for media type information.
Manning's focus is on computing titles at professional levels. We care about the quality of our books. We work with our authors to coax out of them the best writing they can produce. We consult with technical experts on book proposals and manuscripts, and we may use as many as two dozen reviewers in various stages of preparing a manuscript. The abilities of each author are nurtured to encourage him or her to write a first-rate book.
Understanding, analyzing, and generating text with Python
Natural Language Processing in Action is your guide to creating machines that understand human language using the power of Python! As you explore the carefully chosen examples inside you'll train your NLP machine to recognize patterns, extract information from text, and more.