Jump to content

Stopwords and Other languages

Recommended Posts

It would be nice if you could define more than one "other language" in connection with stopwords: Other language 1 ;Other language 2 etc.

Link to post
Share on other sites
  • 1 month later...

What exactly do you mean here?

You can update stopwords in as many languages as you want (I'd recommend keeping a test list for each language in a multilingual project saved in a separate folder in the project). The catch is that you can only have the project set to one language at a time (and thus only be able to use one stopwords list).

I haven't had a chance to try yet, but I suspect you could also manually make a multilingual stopwords list for word frequency queries by combining two or more lists, as long as no more than one of the languages has no spaces between words (like Chinese or Japanese). When I use Japanese, it still can detect word boundaries for English words in the project because of spaces, but if I had Chinese as well, it wouldn't have both dictionaries simultaneously. This wouldn't work for autocoding themes because it wouldn't be able to parse noun phrases in multiple languages.

So if you mean being able to query multiple languages in a single project, there is definitely the alternating by changing the project language option. And there is likely a manually combined stopwords list option as well that would work at least for word frequency queries.

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

  • Create New...

Important Information

Privacy Policy