std::regex_token_iterator

ヘッダ `<regex>` で定義
`template< class BidirIt, class CharT = typename std::iterator_traits<BidirIt>::value_type, class Traits = std::regex_traits<CharT> > class regex_token_iterator`		(C++11以上)

std::regex_token_iterator は、ベースとなる文字シーケンス内の、正規表現のすべてのマッチの個々の部分マッチにアクセスする、読み込み専用の LegacyForwardIterator です。また (トークナイザのように) シーケンスの指定された正規表現がマッチしなかった部分にアクセスするために使うこともできます。

構築時に std::regex_iterator を構築し、毎回のインクリメント時に要求される部分マッチを現在の match_results から辿り、最後の部分マッチをインクリメントし終わるとベースとなる regex_iterator をインクリメントします。

デフォルト構築された std::regex_token_iterator は終端イテレータです。有効な std::regex_token_iterator が最後のマッチの最後のサブマッチに達した後にインクリメントされると、終端イテレータと等しくなります。終端イテレータを逆参照したりさらにインクリメントすると未定義動作を発生させます。

要求される部分マッチのインデックスのリストにインデックス -1 (マッチしない部分) がある場合、 std::regex_token_iterator は終端イテレータになる直前に接尾辞イテレータになることがあります。そのようなイテレータは、逆参照されると、最後のマッチとシーケンスの終端の間の文字シーケンスに対応する match_results を返します。

std::regex_token_iterator の一般的な実装はベースとなる std::regex_iterator、要求される部分マッチのコンテナ (std::vector<int> など)、部分マッチのインデックスと等しい内部カウンタ、現在のマッチの現在のサブマッチを指す std::sub_match へのポインタ、および (トークナイザモードで使用される) 最後のマッチしなかった文字シーケンスを格納する std::match_results オブジェクトを保持します。

型要件

-

BidirIt は LegacyBidirectionalIterator の要件を満たさなければなりません。

特殊化

一般的な文字シーケンス型のためにいくつかの特殊化が定義されます。

ヘッダ `<regex>` で定義
型	定義
`cregex_token_iterator`	`regex_token_iterator<const char*>`
`wcregex_token_iterator`	`regex_token_iterator<const wchar_t*>`
`sregex_token_iterator`	`regex_token_iterator<std::string::const_iterator>`
`wsregex_token_iterator`	`regex_token_iterator<std::wstring::const_iterator>`

メンバ型

メンバ型	定義
`value_type`	`std::sub_match<BidirIt>`
`difference_type`	std::ptrdiff_t
`pointer`	`const value_type*`
`reference`	`const value_type&`
`iterator_category`	std::forward_iterator_tag
`regex_type`	`basic_regex<CharT, Traits>`

メンバ関数

コンストラクタ	新しい `regex_token_iterator` を構築します (パブリックメンバ関数) [edit]
デストラクタ (暗黙に宣言)	キャッシュされている値を含め、 `regex_token_iterator` を破棄します (パブリックメンバ関数) [edit]
operator=	内容を代入します (パブリックメンバ関数) [edit]
operator==operator!= (C++20で削除)	2つの `regex_token_iterator` を比較します (パブリックメンバ関数) [edit]
operator*operator->	現在の部分マッチにアクセスします (パブリックメンバ関数) [edit]
operator++operator++(int)	イテレータを次の部分マッチに進めます (パブリックメンバ関数) [edit]

ノート

イテレータのコンストラクタに渡された std::basic_regex オブジェクトがイテレータより長生きすることを保証するのはプログラマの責任です。イテレータは regex を指すポインタを格納する std::regex_iterator を格納するため、 regex が破棄された後でイテレータをインクリメントすると未定義動作になります。

例

Run this code

#include <fstream>
#include <iostream>
#include <algorithm>
#include <iterator>
#include <regex>

int main()
{
   std::string text = "Quick brown fox.";
   // tokenization (non-matched fragments)
   // Note that regex is matched only two times: when the third value is obtained
   // the iterator is a suffix iterator.
   std::regex ws_re("\\s+"); // whitespace
   std::copy( std::sregex_token_iterator(text.begin(), text.end(), ws_re, -1),
              std::sregex_token_iterator(),
              std::ostream_iterator<std::string>(std::cout, "\n"));

   // iterating the first submatches
   std::string html = "<p><a href=\"http://google.com\">google</a> "
                      "< a HREF =\"http://cppreference.com\">cppreference</a>\n</p>";
   std::regex url_re("<\\s*A\\s+[^>]*href\\s*=\\s*\"([^\"]*)\"", std::regex::icase);
   std::copy( std::sregex_token_iterator(html.begin(), html.end(), url_re, 1),
              std::sregex_token_iterator(),
              std::ostream_iterator<std::string>(std::cout, "\n"));
}

出力:

Quick
brown
fox.
http://google.com
http://cppreference.com

言語
標準ライブラリヘッダ
フリースタンディング処理系とホスト処理系
名前付き要件
言語サポートライブラリ
コンセプトライブラリ (C++20)
診断ライブラリ
ユーティリティライブラリ
文字列ライブラリ
コンテナライブラリ
イテレータライブラリ
範囲ライブラリ (C++20)
アルゴリズムライブラリ
数値演算ライブラリ
ローカライゼーションライブラリ
入出力ライブラリ
ファイルシステムライブラリ (C++17)
正規表現ライブラリ (C++11)
アトミック操作ライブラリ (C++11)
スレッドサポートライブラリ (C++11)
技術仕様書

クラス
basic_regex (C++11)
sub_match (C++11)
match_results (C++11)
アルゴリズム
regex_match (C++11)
regex_search (C++11)
regex_replace (C++11)
イテレータ
regex_iterator (C++11)
regex_token_iterator (C++11)
例外
regex_error (C++11)
特性
regex_traits (C++11)
定数
syntax_option_type (C++11)
match_flag_type (C++11)
error_type (C++11)
正規表現の文法
Modified ECMAScript-262 (C++11)

メンバ関数
regex_token_iterator::regex_token_iterator
regex_token_iterator::operator=
比較
regex_token_iterator::operator==regex_token_iterator::operator!= (C++20未満)
観察
regex_token_iterator::operator*regex_token_iterator::operator->
変更
regex_token_iterator::operator++regex_token_iterator::operator++(int)

cppreference.com

検索

名前空間

変種

表示

操作

std::regex_token_iterator

目次

型要件

特殊化

メンバ型

メンバ関数

ノート

例

案内

ツール