Packages

c

esc.configuration

SimilarityConfig

case class SimilarityConfig(normOrgLegalformWeight: Double = 0.25, normOrgCountryWeight: Double = 0.5, nameElementSimilarityForHit: Double = 0.9, matchSelectionMode: Int = 0, checkDateForSearchHit: Boolean = true, dateComparisonMethod: Int = 0, maxDateYearDifferenceForHit: Int = 2, checkCountryForSearchHit: Boolean = true, similarityValueForSearchHit: Double = 0.9, numberOfHitsForSearchHit: Int = 2, maxNumberOfCandidatesFromSearch: Int = 10000, searchEntityGroupMode: Int = 0, allowOneLetterAbbreviation: Boolean = false, oneLetterAbbreviationWeight: Double = 0.5) extends Product with Serializable

Class for the configuration of the normalizing and similarity stuff. Important: Make sure you use the same configuration for indexing and searching/comparing. Otherwise there may be unwanted side effects.

normOrgLegalformWeight

Weight (reduction) of a legal form match (recommended: < 1, default is 0.25).

normOrgCountryWeight

Weight (reduction) of a country match (recommended: < 1, default is 0.5).

nameElementSimilarityForHit

Minimum similarity to mark as hit. Default is 0.9.

matchSelectionMode

Method of how a match is to be determined: 0 = Based on simialrity. 1 = Based on nofHits (number of hits). Default is 0.

checkDateForSearchHit

Defines whether the date should be taken into account. Default is true.

maxDateYearDifferenceForHit

Defines the uncertainty/tolerance in the annual comparison in number of years (+/-). Default is 2.

checkCountryForSearchHit

Defines whether the country should be considered. Default is true.

similarityValueForSearchHit

Value of the similarity from which the comparison is classified as a hit. Default is 0.9.

numberOfHitsForSearchHit

Value of the nofHits (number of hits) from which the comparison is classified as a hit. Default is 2.

maxNumberOfCandidatesFromSearch

Defines the maximum number of candidates to be considered by the IR search, from which hits are then determined. Default is 10000

searchEntityGroupMode

Defines the field by which the hits are to be grouped. Depends which value is unique: 0 = externalId, 1 = Id. Default is 0.

allowOneLetterAbbreviation

Defines whether abbreviations with a letter are taken into account. With true, for example, Benjamin is a hit with B. Default is false.

oneLetterAbbreviationWeight

If abbreviations are taken into account, this value defines the weight (reduction) of such a hit. Default is 0.5.

Linear Supertypes
Serializable, Product, Equals, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. SimilarityConfig
  2. Serializable
  3. Product
  4. Equals
  5. AnyRef
  6. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Instance Constructors

  1. new SimilarityConfig(normOrgLegalformWeight: Double = 0.25, normOrgCountryWeight: Double = 0.5, nameElementSimilarityForHit: Double = 0.9, matchSelectionMode: Int = 0, checkDateForSearchHit: Boolean = true, dateComparisonMethod: Int = 0, maxDateYearDifferenceForHit: Int = 2, checkCountryForSearchHit: Boolean = true, similarityValueForSearchHit: Double = 0.9, numberOfHitsForSearchHit: Int = 2, maxNumberOfCandidatesFromSearch: Int = 10000, searchEntityGroupMode: Int = 0, allowOneLetterAbbreviation: Boolean = false, oneLetterAbbreviationWeight: Double = 0.5)

    normOrgLegalformWeight

    Weight (reduction) of a legal form match (recommended: < 1, default is 0.25).

    normOrgCountryWeight

    Weight (reduction) of a country match (recommended: < 1, default is 0.5).

    nameElementSimilarityForHit

    Minimum similarity to mark as hit. Default is 0.9.

    matchSelectionMode

    Method of how a match is to be determined: 0 = Based on simialrity. 1 = Based on nofHits (number of hits). Default is 0.

    checkDateForSearchHit

    Defines whether the date should be taken into account. Default is true.

    maxDateYearDifferenceForHit

    Defines the uncertainty/tolerance in the annual comparison in number of years (+/-). Default is 2.

    checkCountryForSearchHit

    Defines whether the country should be considered. Default is true.

    similarityValueForSearchHit

    Value of the similarity from which the comparison is classified as a hit. Default is 0.9.

    numberOfHitsForSearchHit

    Value of the nofHits (number of hits) from which the comparison is classified as a hit. Default is 2.

    maxNumberOfCandidatesFromSearch

    Defines the maximum number of candidates to be considered by the IR search, from which hits are then determined. Default is 10000

    searchEntityGroupMode

    Defines the field by which the hits are to be grouped. Depends which value is unique: 0 = externalId, 1 = Id. Default is 0.

    allowOneLetterAbbreviation

    Defines whether abbreviations with a letter are taken into account. With true, for example, Benjamin is a hit with B. Default is false.

    oneLetterAbbreviationWeight

    If abbreviations are taken into account, this value defines the weight (reduction) of such a hit. Default is 0.5.

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. val allowOneLetterAbbreviation: Boolean
  5. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  6. val checkCountryForSearchHit: Boolean
  7. val checkDateForSearchHit: Boolean
  8. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @native()
  9. val dateComparisonMethod: Int
  10. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  11. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable])
  12. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  13. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  14. val matchSelectionMode: Int
  15. val maxDateYearDifferenceForHit: Int
  16. val maxNumberOfCandidatesFromSearch: Int
  17. val nameElementSimilarityForHit: Double
  18. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  19. val normOrgCountryWeight: Double
  20. val normOrgLegalformWeight: Double
  21. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  22. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  23. val numberOfHitsForSearchHit: Int
  24. val oneLetterAbbreviationWeight: Double
  25. def productElementNames: Iterator[String]
    Definition Classes
    Product
  26. val searchEntityGroupMode: Int
  27. val similarityValueForSearchHit: Double
  28. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  29. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  30. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  31. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from AnyRef

Inherited from Any

Ungrouped