Last date modified: 2026-May-05

Searching for symbols and emojis

This topic describes how to make symbols and emojis searchable in a dtSearch index and how to use regular expressions to search for certain symbols reserved as search operators, such as the % sign.

Searching for symbols

There are two ways to make a character searchable, depending on whether the hexadecimal character code is less than 007F or greater than 0080.

Also refer to:

For hexadecimal characters less than 007F

To search for symbols with hexadecimal characters less than 007F, perform the following steps:

  1. Create a dtSearch index.
  2. Update the Alphabet file to include the % sign as an indexed character. For more information on the Alphabet list, see Alphabet list.
    1. Enter the following under [Letters] // Original letter, lower case, upper case, unaccented:
      [Space] [%] [Space] [%] [Space] [%] [Space] [%]
      You must have a leading space. You cannot have a trailing space.
      Alphabet file with symbols
    2. Delete the % sign from the [Ignore] section. Take care not to remove the gray boxes, which represent non-printable characters. Removing them may break the index.
      Before removing the %:
      The ignore settings on the alphabet file showing the percent symbol that you should delete.
      After removing the %:
      The ignore settings on the alphabet file after you delete the percent symbol.
    3. Repeat these steps for any other symbols that you need to include in an index. Delete the appropriate symbols from the [Spaces], [Ignore], or [Hyphens] sections. Please note, if you edit the these sections, you must keep the leading space. Removal of leading spaces will cause errors in Relativity.
  3. Perform a full build of the dtSearch index.

Example: Searching for percentage (%) terms in documents

An example would be searching for documents that contain the term 75%. Since the % symbol is a reserved dtSearch operator used for fuzzy searching, dtSearch interprets % as a search operator rather than a literal character, even after you have indexed it in the Alphabet file. This is why you must use the regular expression syntax (e.g., "##75\u0025") to search for the literal % symbol. Typing 75% directly into the search box will trigger a fuzzy search instead of looking for the literal text "75%," which may return false hits such as $75. See Fuzzy searching for more information on using the percentage sign for fuzzy searches. To search for documents that contain the term 75%, do the following:

  1. Update the Alphabet file to include the % symbol as an indexed character. See Alphabet list for more information.
  2. Delete the % sign from the [Ignore] section.
  3. Perform a build of the dtSearch index. Enter the following regular expression in your search box, ensuring you select the proper dtSearch index:
Copy
"##75\u0025"

To break down this regular expression into its parts:

  • ## signals to Relativity to treat the string as a regular expression.
  • 75 indicates the percentage amount for the search.
  • \u indicates a search using a character's Unicode value which follows in the next four characters.
  • 0025 indicates the hexadecimal Unicode value for the % symbol.
  • Remove the "\u" before the four character value in the Unicode Entity field when adding to your Alphabet file.
  • For most symbols, once you have indexed the character in the Alphabet file, you can type them directly into the dtSearch box without using regular expressions. For example, to search for 30!, enter 30! into the dtSearch box after you have made the exclamation point (!) an indexed character. You only need to use regular expressions for certain symbols. For more information on regular expressions, see Using regular expressions with dtSearch.

For hexadecimal characters greater than 0080

For characters outside of the ASCII code range, characters greater than 0080, such as § and £, you cannot make them searchable by adding them to the [Letters] section. Instead, create an AdditionalLetters section at the bottom (very last line) of the Alphabet file and insert the characters' Unicode value. For example: AdditionalLetters = 20AC 00A3 00A5. This would make the Euro symbol (20AC), British Pound (00A3), and Japanese Yen (00A5) searchable.

The additional letters settings on the alphabet file.

Be sure to remove the "\u" before the four character value in the Unicode Entity field when adding to your Alphabet file.

Commonly requested searchable symbols

Below is a list of Unicode hex values necessary to make some commonly requested symbols searchable:

  • 00A3 — £ (British Pound)
  • 00A5 — ¥ (Yen half-width)
  • FFE5 — ¥ (Yen full-width)
  • 00A7 — § (Section break)
  • 00A9 — © (Copyright)
  • 00AE — ® (Registered Trademark)
  • 00B6 — ¶ (Paragraph marker)
  • 20AC — € (Euro)

Searching for emojis

Relativity supports searching for emojis with dtSearch indexes.

Note the following about emojis:

  • Emoji search works with file formats that store emojis as Unicode characters, such as Word documents and emails.
    • File formats that store emojis in other formats, such as storing the emoji name, are currently not supported.
    • Some examples include messages from Slack and Teams.

To run emoji searches:

  1. Enter an emoji in the search box.
    You can copy and paste an existing emoji, or use the appropriate shortcut keys for your operating system. For example, in Windows, click the Windows Key Windows Key + period (.) to open the emoji sampler.
    The user interface for running emoji searches.
  2. Click Enter.

Your search results appear in the document list.

Additional emojis

If you need to search for emojis specific to your organization, you can add them to the bottom of your Alphabet file using "AdditionalLetters32." After adding them to your Alphabet file, you must run a full index build.

If you currently have codes listed for AdditionalLetters32, add any new codes to the end of the existing listing. For example, if you currently have "AdditionalLetters32 = 231A 231B" and want to add "23EA" and "23EB," then add them after 231B like this: "AdditionalLetters32 = 231A 231B 23EA 23EB". Do not add another "AdditionalLetters 32" line.

In the Alphabet file, add the unicode for those emojis.

Return to top of the page
Feedback