Add integer properties to a numeric index definition

This procedure describes how to add integer properties to a numeric index definition.

If you do not add integer properties to a numeric index definition, TEXTML Server will assume that all numeric strings are integers. The integer index properties element defines the range of values indexed, how surrounding non-numeric characters are handled, and what characters will be interpreted as thousands separators.
Note: If your numeric index definition applies to monetary units, use decimal properties instead (see Add decimal properties to a numeric index definition).

To add integer properties to a numeric index definition:

  1. Create a numeric index definition. (See Create a numeric index definition.)
  2. Add an <integerindexproperties> element.
  3. To limit the range of values indexed, add an <interval> element. Use the <start> and <end> elements to specify the range:

    In the example below, numbers between 11 and 999,999 inclusive will be indexed:

    <integerindexproperties>
       <interval>
          <start INCLUSIVE="True">
             <number>11</number>
          </start>
          <end INCLUSIVE="True">
             <number>999,999</number>
          </end>
       </interval>
    </integerindexproperties>
  4. To ignore non-numeric content around numbers, add a <contenttrim> element. Value can be any of the following:
    • Left: TEXTML Server ignores non-numeric characters to the left of the number.
    • Right: TEXTML Server ignores non-numeric characters to the right of the number.
    • Both: TEXTML Server ignores non-numeric characters on both sides of the number.
    • None: Numeric strings that include non-numeric characters are not indexed.
    <integerindexproperties>
       <contenttrim VALUE="Both">
       </contenttrim>
    </integerindexproperties>
  5. To specify which symbol should be interpreted as a thousands separator, add a <thousandsymbol> element. Value can be any of the following:
    • Comma, as in 12,000
    • Space, as in 12 000
    • Period as in 12.000
    • All, all separators (comma, space and period) are ignored. (All the above examples indexed as 12000).
    • None: TEXTML Server does not recognize any symbol as a thousands separator, and numbers will be rendered exactly as written.
    <integerindexproperties> 
       <thousandsymbol VALUE="Comma"/> 
    </integerindexproperties>
    Note: The option "Both" is listed in the DTD but not used; it is available only for compatibility with pre-4.X versions of the TEXTML Server DTD.