Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: v4.0.x
Version: 4.0.x


1. We will need regular dictionaries in order to store our custom words and phrases
        a. Make a POST to /regulardictionarystore/regulardictionaries
        With a json in the body like this:

...

Code Block
languagejs
titleexample of a parser configuration
linenumberstrue
collapsetrue
{
	"extensions" : {
    "dictionaries" : [
    	{
        	"type" : "eddi://ai.labs.parser.dictionaries.integer"
        },
        {
            "type" : "eddi://ai.labs.parser.dictionaries.decimal"
        },
        {
            "type" : "eddi://ai.labs.parser.dictionaries.punctuation"
        },
        {
            "type" : "eddi://ai.labs.parser.dictionaries.email"
        },
        {
            "type" : "eddi://ai.labs.parser.dictionaries.time"
        },
        {
            "type" : "	eddi://ai.labs.parser.dictionaries.ordinalNumber"
        },
        {
            "type" : "eddi://ai.labs.parser.dictionaries.regular",
            "config" : {
        	    "uri" : "eddi://ai.labs.regulardictionary/regulardictionarystore/regulardictionaries/<UNIQUE_ID>?version=<VERSION>"
            }
        }
    ],
    "corrections" : [
 {
        	"type" : "eddi://ai.labs.parser.corrections.stemming",
            "config" : {
            	"language" : "english",
                "lookupIfKnown" : "false"
            }
        },
		{
        	"type" : "eddi://ai.labs.parser.corrections.levenshtein",
            "config" : {
            	"distance" : "2"
            }
        },
        {
             "type" : "eddi://ai.labs.parser.corrections.mergedTerms"
        }
    ]},
	"config" : null
}


TypeDescription
eddi://ai.labs.parser.dictionaries.integer
matches all positive integers
eddi://ai.labs.parser.dictionaries.decimal
matches decimal numbers with '.' as well as ',' as a fractional separator
eddi://ai.labs.parser.dictionaries.punctuation

matches common punctuation:

punctuationexpression
!

exclamation_mark

?question_mark
.dot
,comma
:

colon

;semicolon


eddi://ai.labs.parser.dictionaries.email
Matches an email address with regex (\b[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,4}\b)
eddi://ai.labs.parser.dictionaries.time

matches the following time formats:

e.g. 01:20 , 01h20 , 22:40  , 13:43:23

eddi://ai.labs.parser.dictionaries.ordinalNumber
ordinal numbers in English language such as 1st, 2nd, 3rd, 4th, 5th, ...
eddi://ai.labs.parser.dictionaries.regular
uri to a regular dictionary resource: eddi://ai.labs.regulardictionary/regulardictionarystore/regulardictionaries/<UNIQUE_ID>?version=<VERSION>





    3. Last step is to actually use the parser based on the created configurations
        a. POST to /api/v1/parser/<PARSER_ID>?version=<VERSION>
        b. In the body just plain text what you would like to be parsed
        c.The parser will return expressions representing the elements from your plain text

...