header
<regex>
Regular Expressions
Regular expressions are a standardized way to express patterns to be matched against sequences of characters.
The standard C++ library provides support for regular expressions in the <regex> header through a series of operations. All these operations make use of some typical regex parameters:
- Target sequence (subject): The sequence of characters searched for the pattern. Generally, this is a range specified by two iterators, but some functions also accept a c-string or a string object instead.
- Regular expression (pattern): The pattern which is searched for in the target sequence. This must be an object of a basic_regex type (such as regex), generally constructed from a string with a special syntax that describes what constitutes a match (see ECMAScript syntax).
- Matches array: Some operations allow to retrieve information about matches. This information is stored in one of the special match_results array types (such as cmatch or smatch).
- Replacement string: Some operations can replace matches. These replacements are specified in strings that allow for a special format (see ECMAScript syntax).
Regex operations
Regex operations are performed using either functions or special iterator adaptors:
Functions:
- regex_match
- Match sequence (function template)
- regex_search
- Search sequence (function template)
- regex_replace
- Replace matched sequence (function template)
Iterator types:
- regex_iterator
- Regex iterator (class template)
- regex_token_iterator
- Regex token iterator (class template)
Classes
The regex functions and iterators make heavy use of a set of supporting types as arguments and return values:
Basic types:
- basic_regex
- Regular expression (class template)
- match_results
- Match results (class template)
- sub_match
- Sub-expression match (class template)
- regex_traits
- Regex traits (class template)
- regex_error
- Regex exception (class)
Some of these types are templates, and have aliases for their most common instantiations:
basic_regex instantiations:
- regex
- Regex (class)
- wregex
- Regex for wchar_t (class)
match_results instantiations:
- cmatch
- match_results for string literals (class)
- wcmatch
- match_results for wide string literals (class)
- smatch
- match_results for string objects (class)
- wsmatch
- match_results for wide string objects (class)
sub_match instantiations:
- csub_match
- sub_match for string literals (class)
- wcsub_match
- sub_match for wide string literals (class)
- ssub_match
- sub_match for strings (class)
- wssub_match
- sub_match for wide strings (class)
Namespaces
This header also defines a namespace, regex_constants, under which all constant values to be used by the library are located:
- regex_constants
- regex constants (namespace)
Grammar
Regular expressions follow very strict grammars. By default, the functions in this library use the ECMAScript grammar:
- ECMAScript syntax
- ECMAScript regular expressions pattern syntax (syntax specifications)
Other functions
- begin
- Iterator to beginning (function template)
- end
- Iterator to end (function template)