package configuration
- Alphabetic
- Public
- Protected
Type Members
- case class SimilarityConfig(normOrgLegalformWeight: Double = 0.25, normOrgCountryWeight: Double = 0.5, nameElementSimilarityForHit: Double = 0.9, matchSelectionMode: Int = 0, checkDateForSearchHit: Boolean = true, dateComparisonMethod: Int = 0, maxDateYearDifferenceForHit: Int = 2, checkCountryForSearchHit: Boolean = true, similarityValueForSearchHit: Double = 0.9, numberOfHitsForSearchHit: Int = 2, maxNumberOfCandidatesFromSearch: Int = 10000, searchEntityGroupMode: Int = 0, allowOneLetterAbbreviation: Boolean = false, oneLetterAbbreviationWeight: Double = 0.5) extends Product with Serializable
Class for the configuration of the normalizing and similarity stuff.
Class for the configuration of the normalizing and similarity stuff. Important: Make sure you use the same configuration for indexing and searching/comparing. Otherwise there may be unwanted side effects.
- normOrgLegalformWeight
Weight (reduction) of a legal form match (recommended: < 1, default is 0.25).
- normOrgCountryWeight
Weight (reduction) of a country match (recommended: < 1, default is 0.5).
- nameElementSimilarityForHit
Minimum similarity to mark as hit. Default is 0.9.
- matchSelectionMode
Method of how a match has to be determined: 0 = Based on simialrity. 1 = Based on nofHits (number of hits). Default is 0.
- checkDateForSearchHit
Defines whether the date should be taken into account or not. Default is true.
- maxDateYearDifferenceForHit
Defines the uncertainty/tolerance in the annual comparison in number of years (+/-). Default is 2.
- checkCountryForSearchHit
Defines whether the country should be considered or not. Default is true.
- similarityValueForSearchHit
Value of the similarity from which the comparison is classified as a hit. Default is 0.9.
- numberOfHitsForSearchHit
Value of the nofHits (number of hits) from which the comparison is classified as a hit. Default is 2.
- maxNumberOfCandidatesFromSearch
Defines the maximum number of candidates to be considered by the IR search, from which hits are then determined. Default is 10000
- searchEntityGroupMode
Defines the field by which the hits are to be grouped. Depends which value is unique: 0 = externalId, 1 = Id. Default is 0.
- allowOneLetterAbbreviation
Defines whether abbreviations with a letter are taken into account. With true, for example, Benjamin is a hit with B. Default is false.
- oneLetterAbbreviationWeight
If abbreviations are taken into account, this value defines the weight (reduction) of such a hit. Default is 0.5.
Value Members
- object SimilarityConfigFactory
Sugar object for creating SimilarityConfig when using Java.
Sugar object for creating SimilarityConfig when using Java. For using Scala "new SimilarityConfig()" is exactly the same.