Presto 0.127t Documentation

12.10. Regular Expression Functions

12.10. Regular Expression Functions

All of the regular expression functions use the Java pattern syntax, with a few notable exceptions:

regexp_extract_all(string, pattern) → array<varchar>

Returns the substring(s) matched by the regular expression pattern in string.

regexp_extract_all(string, pattern, group) → array<varchar>

Finds all occurrences of the regular expression pattern in string and returns the capturing group number group.

regexp_extract(string, pattern) → varchar

Returns the first substring matched by the regular expression pattern in string.

regexp_extract(string, pattern, group) → varchar

Finds the first occurrence of the regular expression pattern in string and returns the capturing group number group.

regexp_like(string, pattern) → boolean

Evaluates the regular expression pattern and determines if it is contained within string.

This function is similar to the LIKE operator, expect that the pattern only needs to be contained within string, rather than needing to match all of string. In other words, this performs a contains operation rather than a match operation. You can match the entire string by anchoring the pattern using ^ and $.

regexp_replace(string, pattern) → varchar

Removes every instance of the substring matched by the regular expression pattern from string.

regexp_replace(string, pattern, replacement) → varchar

Replaces every instance of the substring matched by the regular expression pattern in string with replacement. Capturing groups can be referenced in replacement using $g for a numbered group or ${name} for a named group. A dollar sign ($) may be included in the replacement by escaping it with a backslash (\$).

regexp_split(string, pattern) → array<varchar>

Splits string using the regular expression pattern and returns an array. Trailing empty strings are preserved.