
Sorin, I tried it out and found the indexed terms improved as expected. Also confirmed that the user dictionary was loaded, which prevented the critical words from being missed. A simple but rather critical issue in the user experience at this moment: nwSearchFnt.js expects 3 or more characters, while the Japanese words usually consist of 2 or more characters. As a quick fix for testing purposes, I reduced the following threshold to 1 when indexerLanguage=='ja'.
if (finalArray[x].length > 2 || useCJKTokenizing){
if (word.length > 2) {
There are a few other minor things that could be improved; I'll report them in due course. The experience as a whole is very promising. Thank you so much! Thanks, T. Hatanaka -----Original Message----- From: oxygen-user-bounces@oxygenxml.com [mailto:oxygen-user-bounces@oxygenxml.com] On Behalf Of Support Oxygen XML Editor (Sorin Ristache) Sent: Thursday, April 30, 2015 3:15 PM To: T. Hatanaka; oXygen-user@oxygenxml.com Subject: Re: [oXygen-user] Feature request: Improvement of Japanese search for WebHelp Hi, Please test the search in WebHelp pages Japanese content with the following Oxygen 17.0 beta build: http://www.oxygenxml.com/userFiles/oxygen-AllPlatforms-17.0-beta-build-20150... You have to set the Japanese language in your DITA map (by adding the xml:lang="ja-jp" parameter) or in the args.default.language parameter and also follow the instructions at the end of the listing with DITA transformation output messages for adding the Kuromoji analyzer runtime file. After you add the runtime file the DITA transformation will detect it and call it to index the Japanese content. The search terms should be separated by spaces as in European languages but I think you already know that better than us because you are the one who suggested we can assume that for a Japanese search string entered in a web page. A Japanese user dictionary for the Kuromoji analyzer is optional and can be set in the webhelp.search.japanese.dictionary parameter. Best regards, Sorin <oXygen/> XML Editor http://www.oxygenxml.com On 4/30/2015 8:47 AM, T. Hatanaka wrote:
Super! It'll be a great bonus to 17.0 whose beta itself is already amazing. I'll definitely test it out as soon as released. If I can help in any way at this moment, please let me know.
Thanks, T. Hatanaka
-----Original Message----- From: oxygen-user-bounces@oxygenxml.com [mailto:oxygen-user-bounces@oxygenxml.com] On Behalf Of Sorin Ristache Sent: Tuesday, April 28, 2015 11:16 PM To: T. Hatanaka; oXygen-user@oxygenxml.com Subject: Re: [oXygen-user] Feature request: Improvement of Japanese search for WebHelp
Hi,
We implemented indexing of Japanese content in the WebHelp pages with the Kuromoji analyzer. It will go in the upcoming Oxygen 17.0 release. The Kuromoji analyzer will not be included in the Oxygen install kit, but a detailed INFO message in the DITA transformation console view will tell you where you can download the analyzer runtime file and where you have to copy it in the Oxygen install directory in order to enable indexing of Japanese content.
Also a new parameter was added in the transformation for setting a user dictionary to the Kuromoji analyzer. It is an optional parameter, so the search results should be greatly improved on Japanese content even without setting a user dictionary.
Regards, Sorin
<oXygen/> XML Editor
On 4/17/2015 2:34 PM, T. Hatanaka wrote:
Thank you for hearing. Analyzer + user dic will be a huge plus in the relevant cultural area like us.
oXygen-user mailing list oXygen-user@oxygenxml.com http://www.oxygenxml.com/mailman/listinfo/oxygen-user