1. Revision History
1.1. Revision 0 - August, 5th, 2019
-
Initial release.
2. Motivation
Currently | With Proposal |
---|---|
❌ - Compilation error ⚠️ - Compiles, but is
|
✔️ - Compiles and works with no extra template instantiations ✔️ - Compiles and works with no extra templates. is
|
❌ - Compilation error ⚠️ - Compiles, but is and is
|
✔️ - Compiles and works, types match input. ✔️ - Compiles and works, where is and is .
|
Currently in C++, there is no Generic ("with a capital G") way to take a range apart with its iterators and put it back together. That is, the following code is not guaranteed to work:
template < typename Range > auto operate_on_and_return_updated_range ( Range && range ) { using uRange = std :: remove_cvref_t < Range > ; if ( std :: ranges :: empty ( range )) { // ... the below errors return uRange ( std :: forward < Range > ( range )); } /* perform some work with the iterators or similar */ auto first = std :: ranges :: begin ( range ); auto last = std :: ranges :: end ( range ); if ( * first == u '\0xEF ') { // ... std :: advance ( first , 3 ); // ... } // ... algorithm finished, // return the "updated" range! // ... but the below errors return uRange ( std :: move ( first ), std :: move ( last )); } int main () { std :: string_view meow_view = "나는 유리를 먹을 수 있어요. 그래도 아프지 않아요" ; // this line will error std :: string_view sub_view = operate_on_and_return_updated_range ( meow_view ); return 0 ; }
The current fix is to employ
to return a generic subrange:
template < typename Range > auto operate_on_and_return_updated_range ( Range && range ) { using uRange = std :: remove_cvref_t < Range > ; using I = std :: Iterator < uRange > ; using S = std :: Sentinel < uRange > ; using Result = std :: ranges :: subrange < I , S > ; if ( std :: ranges :: empty ( range )) { return uRange ( std :: forward < Range > ( range )); } // perform some work with the // iterators or similar auto first = std :: ranges :: begin ( range ); auto last = std :: ranges :: end ( range ); if ( * first == u '\0xEF ') { // ... std :: advance ( first , 3 ); // ... } // ... algorithm finished, // return the "updated" range! // now it works! return Result ( std :: move ( first ), std :: move ( last )); } int main () { std :: string_view meow_view = "나는 유리를 먹을 수 있어요. 그래도 아프지 않아요" ; auto sub_view = operate_on_and_return_updated_range ( meow_view ); // decltype(sub_view) == // std::ranges::subrange<std::string_view::iterator,std::string_view::iterator> // which is nowhere close to ideal. return 0 ; }
This makes it work with any two pair of iterators, but quickly becomes undesirable from an interface point of view. If a user passes in a
or a
that interface and information is entirely lost to the user of the above function.
does not -- and cannot/should not -- mimic the interface of the view it was created from other than what information comes from its iterators: it is the barebones idea of a pair-of-iterators/iterator-sentinel style of range. This is useful in the generic sense that if a library developer must work with iterators, they can always rely on creation of a
of the iterator and sentinel.
Unfortunately, this decreases usability for end users. Users who have, for example a
, would prefer to have the same type after calling an algorithm. There is little reason why the original type needs to be discarded if it supports being put back together from its iterators. This break-it-apart-and-then-generic-subrange approach also discards any range-specific storage optimizations and layout considerations, leaving us with the most bland kind of range similar to the "pair of iterators" model. Compilation time goes up as well: users must spawn a fresh
for every different set of iterator/sentinel/kind triplet, or handle deeply nested templates in templates as the input types. This makes it impossible to compile interfaces as dynamic libraries without having to explicitly materialize or manually cajole a
into something more palatable for the regular world.
There is also a problem where there are a wide variety of ranges that could conceivably meet this criterion, but do not. The author of the very first draft of this paper was not the only one to see utility in such operations. [p1739r0] does much the same that this paper does, without the introduction of a concept to formalize the behavior it presents. In particular, it selects views which can realistically have their return types changed to match the input range and operations being performed (or a similarly powerful alternative) by asking whether they are constructible from a subrange of the iterators with the expressions acted upon. This paper does not depend on any other papers, but note that the changes from [p1739r0], [p1391r2] and [p1394r2] all follow down to the logical conclusion laid out here:
-
Ranges should be reconstructible from their iterators (or subrange of their iterators) where applicable;
-
and, reconstructible ranges serve a useful purpose in generic algorithms, including not losing information and returning it in a much more cromulent and desirable form.
3. Design
The design is simple and is given in 2 exposition-only concepts added to the standard:
template < typename R > concept pair - reconstructible - range = Range < R > && forwarding - range < std :: remove_reference_t < R >> && std :: Constructible < R , iterator_t < R > , sentinel_t < R >> ; template < typename R > concept reconstructible - range = Range < R > && forwarding - range < std :: remove_reference_t < R >> && std :: Constructible < R , std :: ranges :: subrange < iterator_t < R > , sentinel_t < R >>> ;
It is the formalization that a range can be constructed from its begin iterator and end iterator/sentinel. It also provides an exposition-only concept for allowing a range to be constructed from a
of its iterator/sentinel pair. This allows a developer to propagate the input type’s properties after modifying its iterators for some underlying work, algorithm or other effect. This concept is also the basis of the idea behind [p1739r0].
Both concepts require that the type with any references removed model the exposition-only concept
. This ensures that the validity of the iterators is in fact independent of the lifetime of the range they originate from and that a "reconstructed" range does not depend on the original. We remove reference before performing this check, because all reference types that model
also model
and the intent of the proposed changes is narrower: (re)construction is assumed to be in constant time (this typically implies that
also models
, but it is sufficient to check
). Note that this explicitly excludes types like
from being reconstructible.
3.1. Should this apply to all Ranges?
Not all ranges can meet this requirement. Some ranges contain state which cannot be trivially propagated into the iterators, or state that cannot be reconstructed from the iterator/sentinel pair itself. However, most of the common ranges representing unbounded views, empty views, iterations viewing some section of non-owned storage, or similar can all be constructed from their iterator/iterator or iterator/sentinel pair.
For example
contains a exposition *semiregular-box* template type (ranges.semi.wrap) which holds a value to iterate over. It would not be possible to reconstruct the exact same range (e.g., iterators pointing to the exact same object) with the semi-regular wrapper.
3.2. Applicability
There are many ranges to which this is applicable, but only a handful in the standard library need or satisfy this. If [p1391r2] and [p1394r2] are accepted, then the two most important view types --
and
-- will model both concepts.
already fits this as well. By formalizing concepts in the standard, we can dependably and reliably assert that these properties continue to hold for these ranges. The ranges to which this would be helpfully applicable to in the current standard and proposals space are:
-
(already reconstructible)std :: ranges :: subrange -
(currently under consideration, [p1394r2]);std :: span -
(currently under consideration, [p1391r2]);std :: basic_string_view -
(proposed here);std :: ranges :: empty_view -
and,
(proposed here).std :: ranges :: iota_view
The following range adaptor closure objects will make use of the concepts in determine the type of the returned range:
-
(currently under consideration, [p1035r6] and [p1739r0], re-proposed here);view :: drop -
(currently under consideration, [p1739r0], re-proposed here).view :: take
There are also upcoming ranges from [range-v3] and elsewhere that could model this concept:
-
[p1255r4]'s
;std :: ranges :: ref_maybe_view -
[p0009r9]'s
;std :: mdspan -
and, soon to be proposed by this author for the purposes of output range algorithms, [range-v3]'s
.ranges :: unbounded_view
And there are further range adaptor closure objects that could make use of this concept:
-
,view :: slice
,view :: take_exactly
andview :: drop_exactly
from [range-v3]view :: take_last
Note that these changes will greatly aid other algorithm writers who want to preserve the same input ranges. In the future, it may be beneficial to provide more than just an exposition-only concept to check, but rather a function in the standard in
of the form
, whose goal is to check if it is possible to reconstruct the
or otherwise return a
.
This paper does not propose this at this time because concepts -- and the things that rely on them -- must remain stable from now and into infinity. It is better as an exposition-only named set of helpers whose semantics we are a little more free to improve or be adapted, rather than hard-and-fast functions and named concepts which are impossible to fix or improve in the future due to the code which may rely on it.
3.3. Two Concepts
By giving these ranges
, and/or
constructors, we can enable a greater degree of interface fidelity without having to resort to
for all generic algorithms. There should be a preference for
constructors, because one-argument constructors have extremely overloaded meanings in many containers and some views and may result in having to fight with other constructor calls in a complicated overload set. It also produces less compiler boilerplate to achieve the same result of reconstructing the range when one does not have to go through
. However, it is important to attempt to move away from the iterator, sentinel model being deployed all the time:
offers a single type that can accurately represent the intent and can be fairly easy to constrain overload sets on (most of the time).
This paper includes two concepts that cover both reconstructible methods.
4. Impact
Originally, the impact of this feature was perceived to be small and likely not necessary to work into C++20. Indeed: this paper originally targeted C++23 with the intent of slowly working through existing ranges and range implementations and putting the concept and the manifestation of concepts in range libraries, particularly range-v3, over time.
This changed in the face of [p1739r0]. Hauswedell’s paper here makes it clear there are usability and API wins that are solved by this concept for APIs that are already in the working draft today, and that not having the concept has resulted in interface inconsistency and ad-hoc, one-off fixes to fit limited problem domains without any respite to routines which have a desire to preserve the input types into their algorithms. Since this paper’s concept is likely to change interfaces API return values in a beneficial but ultimately breaking manner, this paper’s consideration was brought up to be presented as a late C++20 paper for the purpose of fixing the interface as soon as possible.
Note that this is a separate concept. It is not to be added to the base
concept, or added to any other concept. It is to be applied separately to the types which can reasonably support it for the benefit of algorithms and code which can enhance the quality of their implementation.
5. Proposed Changes
The following wording is relative to the latest draft paper, [n4820], and to several papers whose utilities have not been completely placed into the C++ Working draft such as [p1739r0] and [p1035r6].
5.1. Feature Test Macro
This paper results in an exposition-only concept to help guide the further development of standard ranges and simplify their usages in generic contexts. There is one proposed feature test macro,
, which is to be input into the standard and then explicitly updated every time a constructor from a pre-existing type is changed to reflect the new wording. We hope that by putting this in the standard early, most incoming ranges will be checked for compatibility with
and
. This paper also notes that both [p1394r2] and [p1391r2] do not add wording for feature test macros, making this paper’s feature test macro the ideal addition to encapsulate all reconstructible range changes.
5.2. Intent
The intent of this wording is to provide greater generic coding guarantees and optimizations by allowing for a class of ranges and views that model the new exposition-only definitions of a reconstructible range:
-
add a new feature test macro for reconstructible ranges to cover constructor changes;
-
add two new exposition-only requirements to [range.req];
-
add expression checks to
to reconstruct the range, similar to [p1739r0];view :: take -
add expression checks to
to reconstruct the range, similar to [p1739r0] and from [p1035r6];view :: drop -
add constructors for reconstructing the range to
;view :: empty_view -
add constructors for reconstructing the range to
;view :: iota_view -
and, make
model forwarding-range by addingview :: iota_view
overloads offriend
andbegin ()
.end ()
For ease of reading, the necessary portions of other proposal’s wording is duplicated here, with the changes necessary for the application of reconstructible range concepts. Such sections are clearly marked.
5.3. Proposed Library Wording
Append to §17.3.1 General [support.limits.general]'s Table 35 one additional entry:
Macro name Value Header(s) __cpp_lib_reconstructible_range 201907L ,
< string_view > ,
< ranges >
< span >
Insert into §24.4.2 Ranges [range.range]'s after clause 7, one additional clause:
8 The exposition-only pair-reconstructible-range and reconstructible-range concepts denote ranges whose iterator and sentinel pair can be used to efficiently construct an object of the type from which they originated.template < typename R > concept pair - reconstructible - range = Range < R > && forwarding - range < std :: remove_reference_t < R >> && std :: Constructible < R , iterator_t < R > , sentinel_t < R >> ; template < typename R > concept reconstructible - range = Range < R > && forwarding - range < std :: remove_reference_t < R >> && std :: Constructible < R , std :: ranges :: subrange < iterator_t < R > , sentinel_t < R >>> ;
Add to §24.6.1.2 Class template
[range.empty.view], a constructor in the synopsis and a new clause for constructor definitions:
namespace std :: ranges { template < class T > requires is_object_v < T > class empty_view : public view_interface < empty_view < T >> { public : constexpr empty_view () noexcept = default ; constexpr empty_view ( T * first , T * last ) noexcept ; static constexpr T * begin () noexcept { return nullptr ; } static constexpr T * end () noexcept { return nullptr ; } static constexpr T * data () noexcept { return nullptr ; } static constexpr ptrdiff_t size () noexcept { return 0 ; } static constexpr bool empty () noexcept { return true; } friend constexpr T * begin ( empty_view ) noexcept { return nullptr ; } friend constexpr T * end ( empty_view ) noexcept { return nullptr ; } }; } 24.6.1.3 Constructors [range.empty.cons]
empty_view ( T * first , T * last ) noexcept ; 1 Expects:
and
first == nullptr .
last == nullptr 2 Effects: none.
Add to §24.6.3 Class template
[range.iota.view]'s clause 2, a constructor to the synopsis:
namespace std :: ranges { template < class I > concept Decrementable = // exposition only see below ; template < class I > concept Advanceable = // exposition only see below ; template < WeaklyIncrementable W , Semiregular Bound = unreachable_sentinel_t > requires weakly - equality - comparable - with < W , Bound > class iota_view : public view_interface < iota_view < W , Bound >> { private : // [range.iota.iterator], class iota_view::iterator struct iterator ; // exposition only // [range.iota.sentinel], class iota_view::sentinel struct sentinel ; // exposition only W value_ = W (); // exposition only Bound bound_ = Bound (); // exposition only public : iota_view () = default ; constexpr explicit iota_view ( W value ); constexpr iota_view ( type_identity_t < W > value , type_identity_t < Bound > bound ); constexpr iota_view ( iterator first , sentinel last ); constexpr iterator begin () const ; constexpr sentinel end () const ; constexpr iterator end () const requires Same < W , Bound > ; constexpr friend iterator begin ( iota_view v ); constexpr friend auto end ( iota_view v ); constexpr auto size () const requires ( Same < W , Bound > && Advanceable < W > ) || ( Integral < W > && Integral < Bound > ) || SizedSentinel < Bound , W > { return bound_ - value_ ; } }; template < class W , class Bound > requires ( ! Integral < W > || ! Integral < Bound > || is_signed_v < W > == is_signed_v < Bound > ) iota_view ( W , Bound ) -> iota_view < W , Bound > ; }
Add to §24.6.3 Class template
[range.iota.view], after clause 8, a constructor:
constexpr iota_view ( iterator first , sentinel last ); 9 Effects: Equivalent to:
iota_view ( * first , last . bound_ )
Add to §24.6.3 Class template
[range.iota.view], after clause 11 (old) / 12 (new):
constexpr friend iterator begin ( iota_view v ); 13 Effects: Equivalent to:
return v . begin (); constexpr friend auto end ( iota_view v ); 14 Effects: Equivalent to:
return v . end ();
5.4. Proposed Library + P1739 wording
Modify §24.7.6.4
[range.take.adaptor] as follows:
1 The name
denotes a range adaptor object.
view :: take For some subexpressionsand
E , the expression
F is expression-equivalent to
view :: take ( E , F ) .
take_ view { E , F } 2 Let
and
E be expressions, and let T be
F . Then the expression
remove_cvref_t < decltype (( E )) > is expression-equivalent to:
view :: take ( E , F )
- —
if
T { ranges :: begin ( E ), ranges :: begin ( E ) + min < iter_ difference_ t < iterator_t < decltype (( E )) >>> ( ranges :: size ( E ), F )} models
T ,
ranges :: RandomAccessRange and pair-reconstructible-range;
ranges :: SizedRange - —
if
T { ranges :: subrange { ranges :: begin ( E ), ranges :: begin ( E ) + min < iter_ difference_ t < iterator_t < decltype (( E )) >>> ( ranges :: size ( E ), F )}} models
T ,
ranges :: RandomAccessRange and reconstructible-range;
ranges :: SizedRange - —
if that is well-formed;
ranges :: take_ view { E , F } - — otherwise,
is ill-formed.
view :: take ( E , F )
5.5. Proposed P1035 + P1739 wording changes
Modify [p1035r6]'s §23.7.8
as follows:
1 The name
denotes a range adaptor object.
view :: drop For some subexpressions E and F, the expression view::drop(E, F) is expression-equivalent to drop_view{E, F}.2 Let
and
E be expressions, and let
F be
T . Then, the expression
remove_cvref_t < decltype (( E )) > is expression-equivalent to:
view :: drop ( E , F )
- —
if
T { ranges :: begin ( E ) + min < iter_ difference_ t < iterator_t < decltype (( E )) >>> ( ranges :: size ( E ), F ), ranges :: end ( E )} models ranges::RandomAccessRange, ranges::SizedRange and pair-reconstructible-range;
T - —
if
T { ranges :: subrange { ranges :: begin ( E ) + min < iter_ difference_ t < iterator_t < decltype (( E )) >>> ( ranges :: size ( E ), F ), ranges :: end ( E )}} models ranges::RandomAccessRange, ranges::SizedRange and reconstructible-range;
T - —
if the expression is well-formed;
ranges :: drop_ view { E , F } - — otherwise,
is ill-formed.
view :: drop ( E , F )
6. Acknowledgements
Thanks to Corentin Jabot, Christopher DiBella, and Hannes Hauswedell for pointing me to p1035 and p1739 to review both papers and combine some of their ideas in here. Thanks to Eric Niebler for prompting me to think of the generic, scalable solution to this problem rather than working on one-off fixes for individuals views.