Regular Expressions

A regular expression is an expression (character string) that is used to match patterns in text.

You can use a regular expression to do these things:

  1. Extract indexes from the body of a document.
  2. Extract indexes from the filename of a document.
  3. Make sure that node indexes are correct. For example, make sure that an e-mail address or a phone number has the correct structure.

You can use all the regular expressions here.

Additionally, you can use a named capturing group.

For information about regular expressions, refer to RegexLib.

Examples of Regular Expression

Simple email validation


Matches: test@example.c
Does not match: test@exa$

Advanced Email Validation


Matches: (and the e-mail address must be at the start of a line)
Does not match: test@example.c
Does not match: test@exa$

Phone number, style 1


Matches: 011-8804411
Does not match: 011-880 4411

Phone number, style 2


Matches: (011)8804411 (and the phone number must be at the start of a line)
Does not match: (011) 880 4411
Does not match: (0118)804411

South African ID

(((\d{2}((0[13578]|1[02])(0[1-9]|[12]\d|3[01])|(0[13456789]|1[012])(0[1-9]|[12]\d|30)|02(0[1-9]|1\d|2[0-8])))|([02468][048]|[13579][26])0229))(( |-)(\d{4})( |-)(\d{3})|(\d{7}))

Matches: 8001015009087
Does not match: d001015009087

Refer National Identification Number.

