IBM Watson™ Ideas

Welcome to the IBM Watson™ Ideas Portal


We welcome and appreciate your feedback on IBM Watson™ Products to help make them even better than they are today!


If you are looking for troubleshooting help or wondering how to use our products and services, please check the IBM Watson™ documentation. Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.


Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal.


If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.

Ability to Split Documents At Ingest Time

It would be useful to be able to provide Discovery logic that would allow it to split one ingested document into multiple indexed documents.  For example, make every paragraph or page a single Discovery document.

  • Phil Anderson
  • Jun 5 2017
  • Shipped
  • Attach files
  • Senthil B commented
    June 30, 2017 07:01

    +1

  • Admin
    Phil Anderson commented
    June 30, 2017 12:29

    Hi Senthil, no need to type +1, just ensure you click the vote button, which actually gives this a plus one :)

  • Lalit Agarwalla commented
    July 20, 2017 15:31

    Along with splitting the document, there is also a need to have HTML version of the text in another field (can be made optional). When we split using Document Conversion as answer units, everything becomes plain text. So even if there is a table or list, it all becomes mixed up.

    Idea is to having a "html" field along with "text" field in the json, just like it is having when we upload html file.

     

     

  • Percy Shi commented
    July 25, 2017 04:46

    have we got some update on this requirement?

    thanks!

  • Admin
    Phil Anderson commented
    October 03, 2017 16:35

    This is now in Production (in beta)

  • Percy Shi commented
    October 03, 2017 16:41

    @James Anderson

    Do we have some document/info about this feature?

     

    thanks!

  • Admin
    Phil Anderson commented
    October 03, 2017 16:48

    Yes, you can read the docs here: https://console.bluemix.net/docs/services/discovery/building.html#doc-segmentation and the announcement here https://apps.na.collabserv.com/blogs/152f58a2-3bb3-4992-86a7-c56ad4bbd21c/entry/Document_Splitting_answer_units_Beta_Released?lang=en_us

  • Percy Shi commented
    October 03, 2017 16:53

    thanks, @James Anderson !

  • Guest commented
    December 12, 2017 05:37

    Is there an expectation to add configuring of splitting into the Discovery Tool?  So setting up splitting would happen in the UI rather than using the API...?