TextNormalizer

esc.normalization.TextNormalizer

Attributes

Graph
Supertypes
class Object
trait Matchable
class Any
Self type

Members list

Value members

Concrete methods

def isSingleScript(input: String): Boolean

This method return true if all acharacters in a given text matches to the same Unicode block. Therefore it seams that all characters are of the same alphabet. Only letters in a string are checked. Return true if the string is empty or no letters are dedected.

This method return true if all acharacters in a given text matches to the same Unicode block. Therefore it seams that all characters are of the same alphabet. Only letters in a string are checked. Return true if the string is empty or no letters are dedected.

Value parameters

nameElement

String representing the text to check.

Attributes

Returns

Return Boolean.

def normalize(text: String): String

Default normalize method.

Default normalize method.

Value parameters

text

The string, e.g. a full name, to normalize.

Attributes

Returns

Return the normalized String.

def normalizeForSimpleSimilarity(text: String): String

Deletes all whitespaces and hypthens (" ", "-") in a text.

Deletes all whitespaces and hypthens (" ", "-") in a text.

Value parameters

text

The string, e.g. a full name, to normalize.

Attributes

Returns

Return a normalized String.

def normalizeNameElement(nameElement: String): String

This method normalize a single name element - not a full name. Actually there are some normalizations for Russian and Chinese names.

This method normalize a single name element - not a full name. Actually there are some normalizations for Russian and Chinese names.

Value parameters

nameElement

String representing the name element.

Attributes

Returns

Return a normalized String.

def normalizeWithLegalForm(text: String): String

Special normalize method for organisation names. This method take care of some legal forms with more than one word/name element. Example: GmbH & Co. KG. This method first call normalize itself.

Special normalize method for organisation names. This method take care of some legal forms with more than one word/name element. Example: GmbH & Co. KG. This method first call normalize itself.

Value parameters

text

The string, e.g. a full name, to normalize.

Attributes

Returns

Return a normalized String.