IBM Watson™ Ideas

Welcome to the IBM Watson™ Ideas Portal

We welcome and appreciate your feedback on IBM Watson™ Products to help make them even better than they are today!

If you are looking for troubleshooting help or wondering how to use our products and services, please check the IBM Watson™ documentation. Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.

Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal.

If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.

Stop words should not impact phrases in Discovery Query Language

WDS enables you to define your own set of stop words. We recently noticed that the stop words are removed from searches even when using phrases with the Discovery Language syntax. For example if we search for the phrase "we the people" and we and the are part of the stop words then they will be removed from the search. We understand stop words being ignored by the Natural Language syntax but when stop words are combined in a particular order within a phrase they can be very useful to find relevant content. We think that phrases should not be impacted by stop words. Phrases are meant to be considered as string literals and Discovery should not modify them.  

  • Guest
  • Mar 7 2019
  • Future Consideration
Why is it useful?
Who would benefit from this IDEA? As a customer I want to be able to search for phrases without being modified by the Discovey Language parser
How should it work?
Idea Priority
Priority Justification
Customer Name
Submitting Organization
Submitter Tags
  • Attach files
  • Michael McCawleuy commented
    8 Mar, 2019 12:43pm

    Actually, I don't think this is possible as written.  I'm posting as a customer, not as a Watson engineer here, but  stopwords are indextime directives, so this means these terms are removed from the document to simplify it BEFORE it's indexed.  If the words are gone, no fiddling with query interpretation could put them back.

    I totally agree with the need, though.  In technical or commerce domains, you often have search goals about products that have odd names, or technical jargon.  Finding Volvos with "i Drive" is impossible.

    Something better might be the Common Terms feature of elastic, such as described here:

    and then we can both remove stopwords from body, but enrich documents with common terms they should respond to in separate fields.  Would this work for you?