Internationalization

concerning Java and XML

Ritzberger Fritz
2006-01-06, revised 2017-11-11

Introduction

Normally two types of strings exist in applications:

internal strings that need no translation into a locale language, as they do not appear on any user interface, and
strings that have to be translated into the locale language on application startup because they will be shown on some user interface.

The traditional way to internationalize Java applications is to use some utility class like "Language" that stores the actual translations of any application-defined language-neutral string. This requires the wrapping of any string into some translation-call like "Language.get(neutralString)", and resource-loading of property files that contain the translations (see "Traditional" example below).

When you build a GUI from some XML specification (e.g. XUL), that XML also will hold language-specific strings which have to be translated by the building source-code. This requires hardcoding the structure of the GUI specification: element and attribute names have to be duplicated as source-code strings to be able to access the XML.

In other words:

We have language-specific strings not only in Java code, but also in XML documents!

But these XML files do not only hold language-specific texts, normally there is a lot of semantic, too. In case the schema (structure) of that XML changes, also the processing Java code has to be adapted. This is a classical maintainance problem. The more XML is used to externalize things, the more this aspect gets important.

Wouldn't it be nice to have internationalization on XML level, and maybe use XML instead of property files and resource bundles? Besides, the Java Properties class does not support encodings, so eastern languages have to be written using escape-sequences, which is not very readable. XML supports an encoding declaration, so you can get rid of such cryptography.

By the means of an XSLT processor (contained in JDK since 1.4), an internationalization as pure XML solution (without Java!) is possible, see code below. This concept uses a separate XML file that contains only translations.

Translation texts find their target by using an "id" attribute in the language-neutral XML file (so the latter has to provide such).

The translation is done by an XSLT stylesheet that works generically, that means it does not depend on a certain document type, only the "id" attribute must be present and unique.

Launching the XSLT transformation is the only part that has to be coded in some programming language. Thus, such a solution can be used with any programming-language that provides libraries for XML and XSLT processing.

The following Java solution provides the same naming conventions as Java resource bundles do: a file named strings.xml will be translated by searching for strings_de.xml, strings_fr.xml, ..., according to the platform locale. Each of these translation XML files can have its own encoding. This eases a translation by some third party: you send the party e.g. an English file strings_en.xml, and you get back the Chinese translation strings_ch.xml, without having to care about the encoding they use. Just be aware that not any GUI font can render Chinese letters!

Traditional way to internationalize applications (using Java):

strings.properties:

Cancel=Default text for Cancel

strings_de.properties:

Cancel=Abbrechen

Application.java:

String
neutralButtonLabel = "Cancel";

JButton button = new
JButton(Language.singleton.get(neutralButtonLabel));

toolbar.add(button);

XML internationalization via XSLT:

import java.util.*;
import java.io.*;
import java.net.URL;
import javax.xml.parsers.*;
import javax.xml.transform.*;
import javax.xml.transform.stream.*;
import javax.xml.transform.sax.*;
import org.xml.sax.*;
import org.xml.sax.helpers.*;

/**
* Read strings.xml,
* transform it using translate.xsl that reads strings_de.xml,
* process the translated result and put it into some Map.
*/
public class Language
{
    public static final Language singleton = new Language("strings.xml", Locale.getDefault());

    private Properties translations = new Properties();

    private Language(String fileName, Locale locale) throws Exception    {
       parse(translate(fileName, locale));
    }

    public String get(String neutralString)   {
       return translations.getProperty(neutralString);
    }

    private void parse(byte [] xml) throws Exception    {
        SAXParserFactory factory = SAXParserFactory.newInstance();
        SAXParser saxParser = factory.newSAXParser();
        saxParser.parse(new ByteArrayInputStream(xml), new IdTextHandler());
    }

    private byte [] translate(String fileName, Localelocale) throws Exception    {
        InputStream styleSheetStream = null;
        InputStream inputStream = null;
        ByteArrayOutputStream outputStream = null;
        try    {
          // load the stylesheet
          styleSheetStream = getClass().getResourceAsStream("translate.xsl");
            StreamSource transformSource = new StreamSource(styleSheetStream);
            Transformer transformer = TransformerFactory.newInstance().newTransformer(transformSource);

          // build the translations URL and pass it as stylesheet parameter
          String baseName = fileName.substring(0, fileName.length() - ".xml".length());
          URL translationsUrl = getClass().getResource(baseName+"_"+locale.getLanguage()+".xml");
            transformer.setParameter("translations", translationsUrl);

          // transform the language-neutral XML file (= translate it)
            inputStream = getClass().getResourceAsStream(fileName);
            outputStream = new ByteArrayOutputStream();

            // now start XSLT processing
           transformer.transform(new StreamSource(inputStream), new StreamResult(outputStream));

           outputStream.close();    // flush
           return outputStream.toByteArray();
        }
        finally    {
           try    { styleSheetStream.close(); }   catch (Exception e)    {}
           try    { inputStream.close(); }    catch(Exception e)    {}
        }
    }

    // SAX callback handler that fills the translations Map
    private class IdTextHandler extends DefaultHandler
    {
       private String id;
       private String currentText;

       public void startElement(String uri,String localName, String qName, Attributes attributes)   {
          id = attributes.getValue("id");
          currentText = "";
       }

       public void characters(

char[] ch,int start,
int length

)   {
          currentText = new String(ch, start, length);
       }

       public void endElement(String uri,String localName, String qName)   {
          if (id != null && qName.equals("string"))
             translations.setProperty(id, currentText);
       }
    }

}

<?xml version="1.0"?>



<xsl:transform
    version="1.0"
    xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
>
    <xsl:output method="xml" encoding="ISO-8859-1" indent="yes" />

    <xsl:param name="translations" />
    <xsl:variable name="translation-map" select="document($translations)" />

    
    <xsl:template match="node()">
        <xsl:choose>
           <xsl:when test="name()">    
               <xsl:call-template name="translate">
                   <xsl:with-param name="node" select="." />
                   <xsl:with-param name="is-attribute" select="false()" />
               </xsl:call-template>
           </xsl:when>

           <xsl:otherwise>    
               <xsl:copy />    
           </xsl:otherwise>
        </xsl:choose>
    </xsl:template>

    
    <xsl:template match="@*">
        <xsl:call-template name="translate">
           <xsl:with-param name="node" select="." />
           <xsl:with-param name="is-attribute" select="true()" />
        </xsl:call-template>
    </xsl:template>

    

    
    <xsl:template name="translate">
        <xsl:param name="node"/>
        <xsl:param name="is-attribute" />

        <xsl:variable name="node-name" select="name($node)" />
        <xsl:variable name="node-id" select="$node/@id" />
        <xsl:variable name="parent-id" select="$node/../@id" />

        <xsl:choose>
           <xsl:when test="$node-id">
               <xsl:call-template name="look-for-translation">
                   <xsl:with-param name="node" select="$node" />
                   <xsl:with-param name="node-id" select="$node-id" />
                   <xsl:with-param name="target" select="$node-name" />
                   <xsl:with-param name="is-attribute" select="$is-attribute" />
               </xsl:call-template>
           </xsl:when>

           <xsl:when test="$parent-id and $node-name != 'id'">
               <xsl:call-template name="look-for-translation">
                   <xsl:with-param name="node" select="$node" />
                   <xsl:with-param name="node-id" select="$parent-id" />
                   <xsl:with-param name="target" select="$node-name" />
                   <xsl:with-param name="is-attribute" select="$is-attribute" />
               </xsl:call-template>
           </xsl:when>

           <xsl:otherwise>
               <xsl:call-template name="found-no-translation">
                   <xsl:with-param name="node" select="$node" />
               </xsl:call-template>
           </xsl:otherwise>
        </xsl:choose>
    </xsl:template>

    
    <xsl:template name="look-for-translation">
        <xsl:param name="node"/>
        <xsl:param name="node-id"/>
        <xsl:param name="target"/>
        <xsl:param name="is-attribute" />

        <xsl:variable name="translation-text" select="$translation-map//translation[
          @idref = $node-id and (not(@target) or @target = $target)]" />

        <xsl:choose>
           <xsl:when test="$translation-text">
               <xsl:choose>
                   <xsl:when test="$is-attribute">    
                       <xsl:attribute name="{ name($node) }">
                           <xsl:value-of select="$translation-text" />
                       </xsl:attribute>
                   </xsl:when>

                   <xsl:otherwise>    
                       <xsl:element name="{ name($node) }">
                           
                           <xsl:for-each select="$node/@*">
                               <xsl:apply-templates select="$node/@*" />
                           </xsl:for-each>

                           
                           <xsl:value-of select="$translation-text" />

                           
                           <xsl:apply-templates select="$node/*[not(text())]" />
                       </xsl:element>
                   </xsl:otherwise>
               </xsl:choose>
           </xsl:when>

           <xsl:otherwise>
               <xsl:call-template name="found-no-translation">
                   <xsl:with-param name="node" select="$node" />
               </xsl:call-template>
           </xsl:otherwise>
        </xsl:choose>
    </xsl:template>

    
    <xsl:template name="found-no-translation">
        <xsl:param name="node"/>

        <xsl:copy>
           <xsl:apply-templates select="$node/@* | $node/node()" />
        </xsl:copy>
    </xsl:template>

</xsl:transform>

Internationalization

Introduction

Traditional way to internationalize applications (using Java):

XML internationalization via XSLT:

Java Example Source

XML Example