IBM Watson™ Ideas

Welcome to the IBM Watson™ Ideas Portal


We welcome and appreciate your feedback on IBM Watson™ Products to help make them even better than they are today!


If you are looking for troubleshooting help or wondering how to use our products and services, please check the IBM Watson™ documentation. Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.


Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal.


If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.

Support for additional DOCUMENT TYPES in Watson Discovery

We're dealing with many different document types and our ability to focus on Watson is limited but its support for only basic document/artifact types in Watson Discovery document injestion.

We encounter many but by percentage here are our top 10:

PDF, HTML, DOC(X), PPT(X)(S), XLS(X), JSON, TXT, RTF, CSV, EPUB, 

We have seen ODT, ODP, ODS, TEX and their relatives mostly when we encounter government clients as well.

While we don't expect Watson to specifically deal with ZIP files it would be nice to have a simple way to package and minimize the size/time/cost of the transfer of artifacts if possible along with other file compression formats.

Eventually we fully expect to encounter more and we want to minimize our efforts, costs and transforms in analyzing them through Watson along with potential for OCR.

  • Guest
  • Dec 12 2017
  • Planned
Why is it useful?
Who would benefit from this IDEA? As a user of analysis tools I would gain insight into a broader range of document formats
How should it work?
Idea Priority
Priority Justification
Customer Name
Submitting Organization
Submitter Tags
  • Attach files
  • Guest commented
    December 12, 2017 05:34

    The most annoying one is that TEXT files are not a supported document type, considering how those are the easiest files to read.

    My customer has these unsupported formats:

    Plain text (txt)

    MS Outlook messages and templates (msg, olt)

    Excel spreadsheets (xlsx, xls)

  • Vijay Gupta commented
    May 16, 2018 15:56

    VMWare is looking to zip up all their JSON files and upload it to discovery. They don't have to deal with all the issues of uploading, retrying, keeping track of the individual files that need to be done otherwise with a sophisticated scripts. Customer Request.