- Find tokens which are not part of a longer token sequence:
"tobacco" not in "tobacco product"
- All words in the string need to be found:
"Scottish Ministers make the following Regulations" all words
- Any word contained in the string need to be found:
"Scottish Ministers make the following Regulations" any word
- All strings need to be found:
{"Scottish Ministers", "make", "Regulations"} all
- Words are found in the specified order and results are returned if there are at most eight
words between tobacco and regulations.
'tobacco regulations' all words ordered distance at most 8 words
- The
occurs
keyword comes into play when more than one occurrence of a token is
to be found. Varius range modifiers are available: exactly
, at
least
, at most
, and from ... to ....
"act" occurs at least 4 times
- The keywords
ftand
, ftor
and ftnot
can also be used to
combine multiple query terms.
{ 'Medical', 'regulations' } all ordered distance at most 3 words
- The
window
keyword accepts those texts in which all keyword occur within the
specified number of tokens.
{ 'tobacco', 'regulations' } all window 7 words
- Sometimes it is interesting to only select texts in which all searched terms occur in the
same sentence
or paragraph
.
{ 'sale of tobacco', 'regulations' } all words same sentence
- If case is insensitive, no distinction is made between characters in upper and lower case. By
default, the option is
insensitive
; it can also be set to
sensitive
.
{ 'sale of tobacco', 'regulations' } all words using case sensitive same sentence
- If
stemming
is activated, words are shortened to a base form by a
language-specific stemmer.
"saling tobacco" all words using stemming same paragraph
- The wildcards option facilitates search operations similar to simple regular expressions:
.
matches a single arbitrary character.
.?
matches either zero or one character.
.*
matches zero or more characters.
.+
matches one or more characters.
.{min,max}
matches min–max number of characters.
"Regulations 2017 and come into force on .* April 2017" using wildcards