views::cache_latest
Document #: | P3138R5 |
Date: | 2024-11-18 |
Project: | Programming Language C++ |
Audience: |
LWG |
Reply-to: |
Tim Song <t.canens.cpp@gmail.com> |
This paper proposes the views::cache_latest
adaptor that caches the result of the last dereference of the underlying iterator.
operator-
for sized_sentinel_for
cases.cache_latest
per SG9 feedback.The motivation for this view is given in [P2760R1] (under the name cache_last
) and quoted below for convenience:
One of the adaptors that we considered for C++23 but ended up not pursuing was what range-v3 calls
cache1
and what we’d instead like to call something likecache_last
. This is an adaptor which, as the name suggests, caches the last element. The reason for this is efficiency - specifically avoiding extra work that has to be done by iterator dereferencing.The canonical example of this is
transform(f) | filter(g)
, where if you then iterate over the subsequent range,f
will be invoked twice for every element that satisfiesg
:int main() { std::vector<int> v = {1, 2, 3, 4, 5}; auto even_squares = v | std::views::transform([](int i){ std::print("transform: {}\n", i); return i * i; }) | std::views::filter([](int i){ std::print("filter: {}\n", i); return i % 2 == 0; }); for (int i : even_squares) { std::print("Got: {}\n", i); } }
prints the following (note that there are 7 invocations of
transform
):The solution here is to add a layer of caching:
Which will ensure that
square
will only be called once per element.
operator*
This is the only reasonable place for this caching to happen. Caching in operator++
would require unnecessary overhead on every traversal if the user does not need to dereference every iterator (e.g., a striding view).
Because operator*
is required to be a const
member function (the cost of relaxing that requirement would be prohibitive as it affects every iterator and range adaptor), yet we want to have it perform a potentially-modifying operation, our options are basically:
This paper proposes the former. Input-only iterators, in general, are poor candidates for sharing across threads because, even when copyable (and they are not required to be in C++20), they are subject to “spooky action at a distance” where incrementing one iterator invalidates all copies thereof. This is why parallel algorithms require at least forward iterators. Sharing references to the same input iterator across threads seems like a fairly contrived scenario.
Moreover, std::istreambuf_iterator
already doesn’t meet this requirement in every major standard library implementation yet it does not seem to have appeared on any standard library maintainer’s radar in the decade since the publication of C++11. This further suggests that this guarantee is not relied upon in practice for input-only iterators.
It is true that we could require synchronization on operator*() const
. This probably isn’t terrible expensive in this context (we only need to protect against operator*
calls racing with each other, not them racing with operator++
), but adding synchronization to an adaptor whose primary purpose is to improve performance seems particularly dissatisfying when that synchronization will almost never be actually necessary.
During the Tokyo SG1 meeting, the room favored a limited carve-out to 16.4.6.10 [res.on.data.races] for this adaptor only. As it turns out, p1 of that subclause already has “unless otherwise specified”, so we don’t need to make any additional modification there. However, the wording is unclear how any of the requirements apply to templated functions in the standard library; this will be addressed separately as an issue.
range-v3 uses range_value_t<V>&&
, but this somewhat defeats the purpose of caching if you can so easily invalidate it. Moreover, there doesn’t seem to be a reason to force an independent copy of the value_type
. So long as the underlying iterator is not modified, the reference obtained from operator*
should remain valid.
This paper therefore proposes range_reference_t<V>&
. Note that even if the reference type is a real reference, it can be an expensive-to-compute reference, so caching could still make sense.
cache_latest
is never borrowed, input-only, never common, and not const-iterable.
iter_move
and iter_swap
indirectly_readable
and indirectly_swappable
requires iter_move
and iter_swap
to respectively not modify i
(in the 18.2 [concepts.equality] sense). Moreover, indirectly_readable
requires *i
to be equality-preserving. So the cache should not be invalidated by either operation. (The underlying element might be modified, but the reference itself, obtained from dereferencing the iterator, cannot.)
range-v3 calls this cache1
, which is not a particularly informative name (what does the “1” stand for)? The ranges plan papers oscillated between cache_last
and cache_latest
for no clear reason. SG9’s naming poll was a tie between cache<1>
and cache_latest
. The author does not consider the first viable (it is unclear what cache<2>
would mean when the caching implies an input range which can only ever represent one position). However, last
might be confused for the last element in the range (i.e., back()
) or the past-the-end position(as in [first, last)
) while latest
does not have this issue.
R3 of this paper accordingly renamed the proposed view to cache_latest
.
This wording is relative to [N4971].
<ranges>
Add the following to 25.2 [ranges.syn], header <ranges>
synopsis:
// [...]
namespace std::ranges {
// [...]
// [range.cache.latest], cache latest view
template<input_range V>
requires view<V>
class cache_latest_view;
namespace views {
inline constexpr unspecified cache_latest = unspecified;
}
}
cache_latest
Add the following subclause to 25.7 [range.adaptors]:
1 cache_latest_view
caches the last-accessed element of its underlying sequence so that the element does not have to be recomputed on repeated access. [ Note 1: This is useful if computation of the element to produce is expensive. — end note ]
2 The name views::cache_latest
denotes a range adaptor object (25.7.2 [range.adaptor.object]). Let E
be an expression. The expression views::cache_latest(E)
is expression-equivalent to cache_latest_view(E)
. [ Drafting note: Intentional CTAD to avoid double wrapping if E
is already a cache_latest_view
. ]
cache_latest_view
[range.cache.latest.view]template<input_range V>
requires view<V>
class cache_latest_view : public view_interface<cache_latest_view<V>>{
V base_ = V(); // exposition only
using cache_t = conditional_t<is_reference_v<range_reference_t<V>>, // exposition only
add_pointer_t<range_reference_t<V>>,
range_reference_t<V>>;
non-propagating-cache<cache_t> cache_; // exposition only
class iterator; // exposition only
class sentinel; // exposition only
public:
cache_latest_view() requires default_initializable<V> = default;
constexpr explicit cache_latest_view(V base);
constexpr V base() const & requires copy_constructible<V> { return base_; }
constexpr V base() && { return std::move(base_); }
constexpr auto begin();
constexpr auto end();
constexpr auto size() requires sized_range<V>;
constexpr auto size() const requires sized_range<const V>;
};
template<class R>
cache_latest_view(R&&) -> cache_latest_view<views::all_t<R>>;
1 Effects: Initializes
base_
withstd::move(base)
.
2 Effects: Equivalent to:
3 Effects: Equivalent to:
constexpr auto size() requires sized_range<V>;
constexpr auto size() const requires sized_range<const V>;
4 Effects: Equivalent to:
cache_latest_view::iterator
[range.cache.latest.iterator]namespace std::ranges {
template<input_range V>
requires view<V>
class cache_latest_view<V>::iterator {
cache_latest_view* parent_; // exposition only
iterator_t<V> current_; // exposition only
constexpr explicit iterator(cache_latest_view& parent); // exposition only
public:
using difference_type = range_difference_t<V>;
using value_type = range_value_t<V>;
using iterator_concept = input_iterator_tag;
iterator(iterator&&) = default;
iterator& operator=(iterator&&) = default;
constexpr iterator_t<V> base() &&;
constexpr const iterator_t<V>& base() const & noexcept;
constexpr range_reference_t<V>& operator*() const;
constexpr iterator& operator++();
constexpr void operator++(int);
friend constexpr range_rvalue_reference_t<V> iter_move(const iterator& i)
noexcept(noexcept(ranges::iter_move(i.current_)));
friend constexpr void iter_swap(const iterator& x, const iterator& y)
noexcept(noexcept(ranges::iter_swap(x.current_, y.current_)))
requires indirectly_swappable<iterator_t<V>>;
};
}
1 Effects: Initializes
current_
withranges::begin(parent.base_)
andparent_
withaddressof(parent)
.
2 Returns:
std::move(current_)
.
3 Returns:
current_
.
4 Effects: Equivalent to:
5 Effects: Equivalent to:
++*this
.
6 Effects: Equivalent to:
7 [ Note 1: Evaluations of
operator*
on the same iterator object may conflict (6.9.2.2 [intro.races]). — end note ]
friend constexpr range_rvalue_reference_t<V> iter_move(const iterator& i)
noexcept(noexcept(ranges::iter_move(i.current_)));
8 Effects: Equivalent to:
return ranges::iter_move(i.current_);
friend constexpr void iter_swap(const iterator& x, const iterator& y)
noexcept(noexcept(ranges::iter_swap(x.current_, y.current_)))
requires indirectly_swappable<iterator_t<V>>;
9 Effects: Equivalent to:
ranges::iter_swap(x.current_, y.current_)
.
cache_latest_view::sentinel
[range.cache.latest.sentinel]namespace std::ranges {
template<input_range V>
requires view<V>
class cache_latest_view<V>::sentinel {
sentinel_t<V> end_ = sentinel_t<V>(); // exposition only
constexpr explicit sentinel(cache_latest_view& parent); // exposition only
public:
sentinel() = default;
constexpr sentinel_t<V> base() const;
friend constexpr bool operator==(const iterator& x, const sentinel& y);
friend constexpr range_difference_t<V> operator-(const iterator& x, const sentinel& y)
requires sized_sentinel_for<sentinel_t<V>, iterator_t<V>>;
friend constexpr range_difference_t<V> operator-(const sentinel& x, const iterator& y)
requires sized_sentinel_for<sentinel_t<V>, iterator_t<V>>;
};
}
1 Effects: Initializes
end_
withranges::end(parent.base_)
.
2 Returns:
end_
.
3 Returns:
x.current_ == y.end_
.
friend constexpr range_difference_t<V> operator-(const iterator& x, const sentinel& y)
requires sized_sentinel_for<sentinel_t<V>, iterator_t<V>>;
4 Returns:
x.current_ - y.end_
.
friend constexpr range_difference_t<V> operator-(const sentinel& x, const iterator& y)
requires sized_sentinel_for<sentinel_t<V>, iterator_t<V>>;
5 Returns:
x.end_ - y.current_
.
Add the following macro definition to 17.3.2 [version.syn], header <version>
synopsis, with the value selected by the editor to reflect the date of adoption of this paper:
[N4971] Thomas Köppe. 2023-12-18. Working Draft, Programming Languages — C++.
https://wg21.link/n4971
[P2760R1] Barry Revzin. 2023-12-15. A Plan for C++26 Ranges.
https://wg21.link/p2760r1