Tracking Issue for RFC 2930 (read-buf) #78485

nikomatsakis · 2020-10-28T13:12:46Z

This is a tracking issue for the RFC "2930" (rust-lang/rfcs#2930).
The feature gate for the issue is #![feature(read_buf)].

This is now called BorrowedBuf rather than ReadBuf.

About tracking issues

Tracking issues are used to record the overall progress of implementation.
They are also used as hubs connecting to other relevant issues, e.g., bugs or open design questions.
A tracking issue is however not meant for large scale discussion, questions, or bug reports about a feature.
Instead, open a dedicated issue for the specific matter and add the relevant feature gate label.

Steps

Implement the RFC
- Initial implementation in Implement most of RFC 2930, providing the ReadBuf abstraction #81156
- Vectorized APIs (read_buf_vectored, ReadBufs).
Adjust documentation (see instructions on rustc-dev-guide)
Stabilization PR (see instructions on rustc-dev-guide)
- Note that this is a use of rustc_must_implement_one_of, which is a language-observable thing, and thus needs some amount of oversight/documentation about that as a prerequisite of stabilization.

Unresolved Questions

Should read_buf return the number of bytes read like read does or should the ReadBuf track it instead? Some operations, like checking for EOF, are a bit simpler if read_buf returns the value, but the confusion around what is and is not trustworthy is worrysome for unsafe code working with Read implementations.
What should assume_init be named?
Should the API use a wrapper around &mut ReadBuf to prevent unexpected swapping of the caller-provided ReadBuf?
Resolve soundness issue: Unsound BufWriter copy_to specialization with the unstable read_buf feature #93305

Implementation history

Implement most of RFC 2930, providing the ReadBuf abstraction #81156

The text was updated successfully, but these errors were encountered:

nikomatsakis · 2020-10-28T13:16:03Z

( cc @rust-lang/libs )

beepster4096 · 2020-10-28T21:40:43Z

I'm interested in working on this.

@rustbot claim

beepster4096 · 2020-11-08T04:29:48Z

I am slightly confused by the API of ReadBufs. What return type should methods like initialized have: &[u8] or &[IoSlice]? If its the former, how do you know which slices are initialized/filled? If it's the latter, what happens if a slice is partially initialized/filled?

edit: probably I should ask this on zulip instead of github

sfackler · 2020-11-08T13:45:53Z

It would return &[IoSliceMut], and only include the slices that are fully initialized.

programmerjake · 2020-11-08T20:53:59Z

It would return &[IoSliceMut], and only include the slices that are fully initialized.

I would have expected it to return (&[IoSliceMut], &[u8]) where it is some number of fully initialized buffers and the initialized portion of the partially initialized buffer.

sfackler · 2020-11-08T21:08:46Z

Sure, that makes sense.

sfackler · 2020-12-05T00:11:41Z

@drmeepster are you still working on this?

beepster4096 · 2020-12-05T00:34:31Z

Yeah, I am. Although I'm currently working on #79607 because I needed it for this.

beepster4096 · 2020-12-17T05:25:11Z

Okay, I can continue working on this now that we have MaybeUninit::write_slice

sunshowers · 2021-01-19T20:58:06Z

I have a PR to add an inner_mut method to Tokio's implementation of ReadBuf: tokio-rs/tokio#3443. As far as I can tell it's a valid use case that there's no other way to do short of pointer arithmetic, so it may make sense to have this be in upstream Rust's ReadBuf as well.

erickt · 2021-04-28T15:43:40Z

Have we considered extending ReadBuf to be generic on the type, rather than be constrained to u8? I'm guessing much of ReadBuf is generic over the type.

This came up because I'm looking into fixing some UB in rayon, which is passing around uninitialized &mut [T] in its collect iterator. I think the main thing making rayon use this over a &mut Vec<T> is that it wants to split the output buffer across threads, but there's no safe way to do this without initializing the slice. I'd like to replace this with a safe abstraction that's probably quite similar to the design proposed for ReadBuf (plus a split_at method), so I thought maybe there are other people potentially interested in this functionality.

Going further, it would also be interesting to see if we could rewrite Vec to sit upon ReadBuf.

djc · 2021-04-28T15:53:07Z

I made very similar points here:

https://internals.rust-lang.org/t/readbuf-as-part-of-rust-edition-2021/14256/9?u=djc

Amanieu · 2021-04-28T22:05:25Z

Note that we now have a spare_capacity_mut method on Vec which gives you a &mut [MaybeUninit<T>] for the uninitialized part of a vector.

jmillikin · 2023-11-02T05:25:09Z

I'm interested in BorrowedBuf and BorrowedCursor for use in no_std environments. Would it be possible to move them into core and stabilize them separately from the new std::io::{Read,Write} functionality?

The current implementation of those types has no dependency on std, and I'd be happy to send out the PRs if the Rust folks are willing to review them.

…nic, r=dtolnay Don't panic in `<BorrowedCursor as io::Write>::write` Instead of panicking if the BorrowedCursor does not have enough capacity for the whole buffer, just return a short write, [like `<&mut [u8] as io::Write>::write` does](https://doc.rust-lang.org/src/std/io/impls.rs.html#349). (cc `@ChayimFriedman2` rust-lang#78485 (comment)) (I'm not sure if this needs an ACP? since it's not changing the "API", just what the function does)

…c, r=dtolnay Don't panic in `<BorrowedCursor as io::Write>::write` Instead of panicking if the BorrowedCursor does not have enough capacity for the whole buffer, just return a short write, [like `<&mut [u8] as io::Write>::write` does](https://doc.rust-lang.org/src/std/io/impls.rs.html#349). (cc `@ChayimFriedman2` rust-lang#78485 (comment)) (I'm not sure if this needs an ACP? since it's not changing the "API", just what the function does)

Don't panic in `<BorrowedCursor as io::Write>::write` Instead of panicking if the BorrowedCursor does not have enough capacity for the whole buffer, just return a short write, [like `<&mut [u8] as io::Write>::write` does](https://doc.rust-lang.org/src/std/io/impls.rs.html#349). (cc `@ChayimFriedman2` rust-lang/rust#78485 (comment)) (I'm not sure if this needs an ACP? since it's not changing the "API", just what the function does)

the8472 · 2023-11-23T23:17:45Z

The BorrowedCursor documentation could use some polish. It says some slightly contradictory or misleading things. The docs start with

A writeable view of the unfilled portion of a BorrowedBuf.

but the very next paragraph:

Provides access to the initialized and uninitialized parts of the underlying BorrowedBuf.

Looking at the actual implementations makes it obvious that they mostly pass-through and grab slices from the underlying BorrowedBuf and totally ignore the write position (start). The write position is only relevant for written().

tgross35 · 2024-02-26T20:59:10Z

Does core_io_borrowed_buf really need to be a separate feature gate, or could it be rolled into read_buf? Bit confusing that the BorrowedBuf/BorrowedCursor docs all point to #117693 rather than this issue, assuming the same types are designed to work in core.

Also related to @the8472's comment, docs need examples.

Don't panic in `<BorrowedCursor as io::Write>::write` Instead of panicking if the BorrowedCursor does not have enough capacity for the whole buffer, just return a short write, [like `<&mut [u8] as io::Write>::write` does](https://doc.rust-lang.org/src/std/io/impls.rs.html#349). (cc `@ChayimFriedman2` rust-lang/rust#78485 (comment)) (I'm not sure if this needs an ACP? since it's not changing the "API", just what the function does)

a1phyr · 2024-04-09T08:14:58Z

There is something "fun" with read_buf as it is defined today, but I am not sure it was done on purpose: unlike read, it is possible to return both data and an error.
This is possible because the cursor keeps tracks of the read bytes itself, so returning an error is not incompatible with reading bytes.

I don't know if this is expected and it is probably useful, but it might be surprising if some implementations start doing that.

In fact, read_buf_exact documents that it has such a behavior:

If this function returns an error, all bytes read will be appended to cursor.

Something more problematic is that it is impossible to write a correct read implementation on top of read_buf (either the written bytes or the error would have to be discarded).

ChrisDenton · 2024-04-09T13:28:23Z

Tbh, I think read is in the wrong here. For example, I/O reads very much depends on the behaviour of the underlying OS which we have no control over. Currently we are indeed forced to discard any OS errors if any bytes are read.

Don't panic in `<BorrowedCursor as io::Write>::write` Instead of panicking if the BorrowedCursor does not have enough capacity for the whole buffer, just return a short write, [like `<&mut [u8] as io::Write>::write` does](https://doc.rust-lang.org/src/std/io/impls.rs.html#349). (cc `@ChayimFriedman2` rust-lang/rust#78485 (comment)) (I'm not sure if this needs an ACP? since it's not changing the "API", just what the function does)

a1phyr · 2024-05-21T09:18:37Z

It would be great to have a final word from libs-team on this:

Say that this undesirable and change the API to prevent this
Say that this undesirable and document about such behavior (and that such read_buf implementations are buggy)
Say that this fine (or even great), document that this may happen because it is not obvious and change the few uses of read_buf in std (that don't take this possibility into account).
Some other choice that I didn't think about ?

Currently we are indeed forced to discard any OS errors if any bytes are read.

I don't about Windows at all but on Unix at least I don't think this is possible. Can you link such an implementation ?

ChrisDenton · 2024-05-21T09:46:39Z

See the post above for why this is nominated.

Currently we are indeed forced to discard any OS errors if any bytes are read.

I don't about Windows at all but on Unix at least I don't think this is possible. Can you link such an implementation ?

Neither the Windows API nor the documentation guarantees that no bytes will be read in an error case. One documented case is pipes in message mode:

If a named pipe is being read in message mode and the next message is longer than the nNumberOfBytesToRead parameter specifies, ReadFile returns FALSE and GetLastError returns ERROR_MORE_DATA. The remainder of the message can be read by a subsequent call to the ReadFile or PeekNamedPipe function.

To be clear, I'd have to check if we actually do handle this (I suspect not) but if we're going strictly by the Read trait documentation then we should test if any bytes are read and return success if so. The other option, of course, would be to weaken the std documentation.

de-vri-es · 2024-05-21T11:59:19Z

See the post above for why this is nominated.

Currently we are indeed forced to discard any OS errors if any bytes are read.

I don't about Windows at all but on Unix at least I don't think this is possible. Can you link such an implementation ?

Neither the Windows API nor the documentation guarantees that no bytes will be read in an error case. One documented case is pipes in message mode:

If a named pipe is being read in message mode and the next message is longer than the nNumberOfBytesToRead parameter specifies, ReadFile returns FALSE and GetLastError returns ERROR_MORE_DATA. The remainder of the message can be read by a subsequent call to the ReadFile or PeekNamedPipe function.

To be clear, I'd have to check if we actually do handle this (I suspect not) but if we're going strictly by the Read trait documentation then we should test if any bytes are read and return success if so. The other option, of course, would be to weaken the std documentation.

Does this case really matter? The Read trait is for byte streams, not message streams. For example, UdpSocket doesn't implement the Read trait (I assume for this reason).

The example you give is very similar to the MSG_TRUNC flag in a recvmsg() call. But for a bytestream this can never happen. For Read, reading a partial message, swallowing the error and performing another read is the correct thing to do.

ChrisDenton · 2024-05-21T14:03:11Z

The Read trait is implemented for File which means it needs to work for anything you can open with File::open, The read method also has zero context for the read. All this means it's completely oblivious to what it is reading and has to work (for some definition of "work") in all situations, especially when we provide a "guarantee".

Maybe swallowing any error is fine. But I think in an ideal world we'd have some way for the user to access this information.

Amanieu · 2024-05-21T16:21:10Z

We discussed this in the libs-api meeting today. We agree that this can be surprising behavior, but don't see a good way to change the API to avoid it. Our conclusion was to update the documentation to say that returning an error after reading some bytes is allowed, but strongly discouraged.

Pr0methean · 2024-05-21T16:37:41Z

Maybe the return type should be Result<usize, (usize, io::Error)> so that the length of the partial read is available?

fintelia · 2024-05-21T17:42:27Z

I disagree that the signature of read is to blame here. That method is intended to be called repeatedly in a loop to get all the data you need. Thus if an implementation wants to return both data and an error it'll return the data the first time read is called and cache the error to return on the second call.

The problem comes in because a default implementation of read in terms of read_buf cannot implement caching behavior because it doesn't have access to fields within the reader. Thus, another option would be for the standard library to document that the default implementation of read swallows errors if any bytes are produced, and recommend that if read_buf is capable of producing both data and an error that the implementer should also provide a read implementation that caches errors.

farnz · 2024-07-11T08:56:03Z

On internals.r-l.o, keepsimple1 commented that they didn't realise that ReadBuf has been renamed to BorrowedBuf rather than removed, because it's not obvious from the summary.

Could you update the summary to include something like:

This is now called BorrowedBuf rather than ReadBuf.

Thanks!

Amanieu · 2024-07-12T16:40:41Z

I've added a note in the summary, hopefully that should help.

nikomatsakis added B-RFC-approved Blocker: Approved by a merged RFC but not yet implemented. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. C-tracking-issue Category: A tracking issue for an RFC or an unstable feature. labels Oct 28, 2020

nikomatsakis mentioned this issue Oct 28, 2020

RFC: Reading into uninitialized buffers rust-lang/rfcs#2930

Merged

rustbot assigned beepster4096 Oct 28, 2020

taiki-e mentioned this issue Oct 30, 2020

Add AsyncRead::poll_read_buf based on RFC 2930 rust-lang/futures-rs#2209

Open

KodrAus added the Libs-Tracked Libs issues that are tracked on the team's project board. label Nov 6, 2020

KodrAus added the A-io Area: std::io, std::fs, std::net and std::path label Nov 28, 2020

tesaguri mentioned this issue Nov 29, 2020

io: propose new AsyncRead / AsyncWrite traits tokio-rs/tokio#2716

Open

notgull mentioned this issue Dec 1, 2020

Implement reading buffers in term of ReadBuf bread-graphics/breadx#3

Closed

tomhoule mentioned this issue Dec 1, 2020

Do not swallow errors in simple query. Fixing transaction descriptors. Read envchanges correctly. prisma/tiberius#105

Merged

newpavlov mentioned this issue Dec 21, 2020

Implement Fill for [MaybeUninit<T>] rust-random/rand#1080

Closed

Kestrer mentioned this issue Jan 10, 2021

Tracking issue for Read::initializer #42788

Closed

tmandry mentioned this issue Jan 12, 2021

AsyncRead, AsyncWrite traits rust-lang/wg-async#23

Closed

beepster4096 mentioned this issue Jan 18, 2021

Implement most of RFC 2930, providing the ReadBuf abstraction #81156

Merged

jmillikin mentioned this issue Nov 2, 2023

ACP: Move std::io::Borrowed{Buf,Cursor} into core::io rust-lang/libs-team#290

Closed

VictorKoenders mentioned this issue Nov 6, 2023

get_byte_buffer equivalent in bincode 2? bincode-org/bincode#679

Closed

Thomasdezeeuw mentioned this issue Dec 8, 2023

Adding methods for accepting &mut [MaybeUninit<u8>] tokio-rs/mio#1574

Closed

taralx mentioned this issue Mar 16, 2024

Tracking Issue for core_io_borrowed_buf #117693

Open

4 tasks

VictorKoenders mentioned this issue Mar 17, 2024

Support retrieving the bytes written from encode bincode-org/bincode#703

Closed

a1phyr mentioned this issue Mar 22, 2024

Unix: Add read_buf_at and read_buf_exact_at to FileExt #122887

Closed

a1phyr mentioned this issue Apr 8, 2024

Enable accessing written data in a BorrowedCursor rust-lang/libs-team#367

Closed

ChrisDenton added the I-libs-api-nominated Nominated for discussion during a libs-api team meeting. label May 21, 2024

Amanieu removed the I-libs-api-nominated Nominated for discussion during a libs-api team meeting. label May 21, 2024

a1phyr mentioned this issue May 22, 2024

Fix read_buf uses in std #125404

Open

Evian-Zhang mentioned this issue Jun 14, 2024

ptrace::getregs & ptrace::getregset may lead to UB of uninitialized data read nix-rust/nix#2447

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking Issue for RFC 2930 (read-buf) #78485

Tracking Issue for RFC 2930 (read-buf) #78485

nikomatsakis commented Oct 28, 2020 •

edited by Amanieu

Loading

nikomatsakis commented Oct 28, 2020

beepster4096 commented Oct 28, 2020

beepster4096 commented Nov 8, 2020 •

edited

Loading

sfackler commented Nov 8, 2020

programmerjake commented Nov 8, 2020

sfackler commented Nov 8, 2020

sfackler commented Dec 5, 2020

beepster4096 commented Dec 5, 2020

beepster4096 commented Dec 17, 2020

sunshowers commented Jan 19, 2021

erickt commented Apr 28, 2021

djc commented Apr 28, 2021

Amanieu commented Apr 28, 2021

jmillikin commented Nov 2, 2023

the8472 commented Nov 23, 2023 •

edited

Loading

tgross35 commented Feb 26, 2024

a1phyr commented Apr 9, 2024

ChrisDenton commented Apr 9, 2024

a1phyr commented May 21, 2024

ChrisDenton commented May 21, 2024

de-vri-es commented May 21, 2024 •

edited

Loading

ChrisDenton commented May 21, 2024

Amanieu commented May 21, 2024

Pr0methean commented May 21, 2024

fintelia commented May 21, 2024

farnz commented Jul 11, 2024

Amanieu commented Jul 12, 2024

Tracking Issue for RFC 2930 (read-buf) #78485

Tracking Issue for RFC 2930 (read-buf) #78485

Comments

nikomatsakis commented Oct 28, 2020 • edited by Amanieu Loading

About tracking issues

Steps

Unresolved Questions

Implementation history

nikomatsakis commented Oct 28, 2020

beepster4096 commented Oct 28, 2020

beepster4096 commented Nov 8, 2020 • edited Loading

sfackler commented Nov 8, 2020

programmerjake commented Nov 8, 2020

sfackler commented Nov 8, 2020

sfackler commented Dec 5, 2020

beepster4096 commented Dec 5, 2020

beepster4096 commented Dec 17, 2020

sunshowers commented Jan 19, 2021

erickt commented Apr 28, 2021

djc commented Apr 28, 2021

Amanieu commented Apr 28, 2021

jmillikin commented Nov 2, 2023

the8472 commented Nov 23, 2023 • edited Loading

tgross35 commented Feb 26, 2024

a1phyr commented Apr 9, 2024

ChrisDenton commented Apr 9, 2024

a1phyr commented May 21, 2024

ChrisDenton commented May 21, 2024

de-vri-es commented May 21, 2024 • edited Loading

ChrisDenton commented May 21, 2024

Amanieu commented May 21, 2024

Pr0methean commented May 21, 2024

fintelia commented May 21, 2024

farnz commented Jul 11, 2024

Amanieu commented Jul 12, 2024

nikomatsakis commented Oct 28, 2020 •

edited by Amanieu

Loading

beepster4096 commented Nov 8, 2020 •

edited

Loading

the8472 commented Nov 23, 2023 •

edited

Loading

de-vri-es commented May 21, 2024 •

edited

Loading