More

rockorager · 2025-10-27T05:03:53 1761541433

If you work with LLM agents, you will immediately be able to tell this issue is written by one. The time cost of this sort is almost certainly not real, as others have pointed out.

I’ve had agents find similar “performance bottlenecks” that are indeed BS.

znpy · 2025-10-27T11:16:02 1761563762

as one of the commenters to the issue wrote, arguing whether the text is ai-generated or not is essentially useless.

the important question is: is this an actual performance bug?

cgriswald · 2025-10-27T12:28:03 1761568083

The question is whether credence, and therefore time, should be given to the claim that the performance bug exists.

rockorager · 2025-07-18T18:04:29 1752861869

Oh no - where at?

nbf_1995 · 2025-07-18T18:32:30 1752863550

Technically, the first, third, and fifth occurrence of "it's" should be "its". The dog chased its tail.

I didn't notice when I read the article though. The original commenter is being pedantic.

teo_zero · 2025-07-19T15:02:29 1752937349

And since you're asking, beware of "let's" too: the third one is wrong!

rockorager · 2025-07-18T16:22:08 1752855728

You would have to write your IO to have a fallback. The Ghostty project uses `io_uring`, but on kernels where it isn't available it falls back to an `epoll` model. That's all handled at the library level by libxev.

rockorager · 2025-07-18T15:57:48 1752854268

Yeah, I wrote this as a fun little experiment to learn more io_uring usage. The practical savings of using this are tiny, maybe 5 seconds over your entire life. That wasn't the point haha

JuettnerDistrib · 2025-07-18T16:20:49 1752855649

I'd be curious to know if this helps on supercomputers, which are notorious for frequently hanging for a few seconds on an ls -l.

mrlongroots · 2025-07-18T18:37:09 1752863829

It could, but important to keep in mind that the filesystem architecture there is also very different with a parallel filesystem with disaggregated data and metadata.

When you run `ls -l` you could potentially be enumerating a directory with one file per rank, or worse, one file per particle or something. You could try making the read fast, but I also think that it makes no sense to have that many files: you can do things to reduce the number of files on disk. Also many are trying to push for distributed object stores instead of parallel filesystems... fun space.

maplant · 2025-07-18T16:35:29 1752856529

It's a very cool experiment. Just wanted to perhaps steer the conversation towards those things rather than whether or not this was a good ls replacement because like you say that feels like it was missing the point

rockorager · 2025-07-18T15:46:31 1752853591

Author of the project here! I have a little write up on this here: https://rockorager.dev/log/lsr-ls-but-with-io-uring

ChuckMcM · 2025-07-18T19:08:55 1752865735

Nice writeup. I suspect you're measuring the cost of abstraction. Specifically, routines that can handle lots of things (like locale based strings and utf8 character) have more things to do before they can produce results. This was something I ran into head on at Sun when we did the I18N[1] project.

In my experience there was a direct correlation between the number of different environments where a program would "just work" and its speed. The original UNIX ls(1) which had maximum sized filenames, no pesky characters allowed, all representable by 7-bit ASCII characters, and only the 12 bits of meta data that God intended[2] was really quite fast. You add things like a VFS which is mapping the source file system into the parameters of the "expected" file system that adds delay. You're mapping different character sets? adds delay. Colors for the display? Adds delay. Small costs that add up.

1: The first time I saw a long word like 'internationalization' reduced to first and last letter and the count of letters in between :-).

2: Those being Read, Write, and eXecute for user, group, and other, setuid, setgid, and 'sticky' :-)

spauldo · 2025-07-19T07:22:24 1752909744

> the 12 bits of meta data that God intended

Naw, that's Dennis Ritchie. You're thinking of the other white-bearded guy that hangs out in heaven.

CalChris · 2025-07-19T09:07:16 1752916036

Other people attribute i18n to DEC circa 1985.

http://www.i18nguy.com/origini18n.html

ChuckMcM · 2025-07-19T18:47:41 1752950861

I did not mean to imply I invented it, that was just the first time I had seen that form of shorthand. Presumably whomever at Sun who called it that was no doubt aware of other efforts, I was but a lowly entry level programmer in the systems group. That was long before I had acquired the power to name things at Sun :-)

jeffbee · 2025-07-18T16:01:58 1752854518

How much of the speedup over GNU ls is due to lacking localization features? Your results table is pretty much consistent with my local observations: in a dir with 13k files, `ls -al` needs 33ms. But 25% of that time is spent by libc in `strcoll`. Under `LC_ALL=C` it takes just 27ms, which is getting closer to the time of your program.

rockorager · 2025-07-18T16:10:18 1752855018

I didn't include `busybox` in my initial table, so it isn't on the blog post but the repo has the data...but I am 99% sure busybox does not have locale support, so I think GNU ls without locale support would probably be closer to busybox.

Locales also bring in a lot more complicated sorting - so that could be a factor also.

tavianator · 2025-07-18T16:27:00 1752856020

My bfs project also uses io_uring: https://github.com/tavianator/bfs/blob/main/src/ioq.c

I'm curious how lsr compares to bfs -ls for example. bfs only uses io_uring when multiple threads are enabled, but maybe it's worth using it even for bfs -j1

rockorager · 2025-07-18T16:40:10 1752856810

Oh that's cool. `find` is another tool I thought could benefit from io_uring like `ls`. I think it's definitely worth enabling io_uring for single threaded applications for the batching benefit. The kernel will still spin up a thread pool to get the work done concurrently, but you don't have to manage that in your codebase.

tavianator · 2025-07-18T16:57:49 1752857869

I did try it a while ago and it wasn't profitable, but that was before I added stat() support. Batching those is probably good

mshockwave · 2025-07-18T17:10:05 1752858605

and grep / ripgrep. Or did ripgrep migrate to using io_uring already?

burntsushi · 2025-07-18T17:20:28 1752859228

No, ripgrep doesn't use io_uring. Idk if it ever will.

porridgeraisin · 2025-07-18T18:41:26 1752864086

Curious: Why? Is it not a good fit for what ripgrep does? Isn't the sort of "streaming" "line at a time" I/O that ripgrep does a good fit for async io?

burntsushi · 2025-07-18T19:35:01 1752867301

For many workloads, ripgrep spends the vast majority of its time searching through files.

But more practically, it would be a terror to implement. ripgrep is built on top of platform specific standard file system APIs. io_uring would mean a whole heap of code to work with a different syscall pattern in addition to the existing code pattern for non-Linux targets.

So to even figure out whether it would be worth doing that, you would need to do a whole bunch of work just to test it. And because of my first point above, there is a hard limit on how much of an impact it could even theoretically have.

Where I would expect this to help is to batch syscalls during directory tree traversal. But I have nonidea how much it would help, if at all.

Sesse__ · 2025-07-18T21:58:20 1752875900

I believe that io_uring does not support getdents (despite multiple patch series being proposed). So you'd get async stat(), if you need them, but nothing else.

burntsushi · 2025-07-20T00:37:17 1752971837

Oh. Well, that's definitely a deal breaker then. ripgrep already avoids stat calls on most files.

cb321 · 2025-07-18T20:16:08 1752869768

At those time scales, you would be better off using `tim` ( https://github.com/c-blake/bu/blob/main/doc/tim.md ) than hyperfine { and not just because that is your name! Lol. That is just a happy coincidence by clipping one letter off of the word "time". :-) } even though being in Nim might make it more of a challenge.

dang · 2025-07-18T17:51:19 1752861079

(Thanks - we'll make that the main link (since it has more background info) and include the repo thread at the top as well.)

benreesman · 2025-07-18T18:05:06 1752861906

This is fantastic stuff. I'm doing a C++ project right now that I'm doing with an eye to eventual migration in whole or in part to Zig. My little `libevring` thing is pretty young and I'd be very open to replacing it with `ourio`.

What's your feeling on having C/C++ bindings in the project as a Zig migration path for such projects?

rockorager · 2025-07-18T18:09:28 1752862168

I think exposing a C lib would be very nice. Feel free to open a discussion or issue on the Github.

rockorager · on Jan 1, 2025

I've discussed with Mitchell a bit in the past on this, but there are a few ways to opt in to this. First, let's imagine there is some escape sequence that tells the terminal to ignore some set of escape sequences.

1. The shell could have a keybind (say, ctrl+enter) which runs the command with (for example) only styling enabled (CSI m sequences).

2. You could write a wrapper around any program to do the same. The shell doesn't need to know anything about the system, but instead you have a wrapper that disables and then reenables said sequences.

3. A program itself can opt in by turning on the feature at launch. This allows CLI / TUI developers to "safeguard" their programs from malicious attacks.

rockorager · on Dec 10, 2024

I understand the fear of living near such a heinous crime, but you have to see the difference between the two situations. Random crime is scary, but is random. All signs point to this being an assassination. Brian Thompson wasn't the victim of a random criminal, nor was he murdered by some person he has a connection with. He was assassinated. So the right question is why do we investigate assassinations differently than murders? And to me, the answer is clear - an assassination has far larger societal implications than a murder.

kevinventullo · on Dec 10, 2024

Yes, the purpose of an assassination is to have societal implications. In this case, the implications primarily threaten the people in power. Thus, those people direct their forces at preventing and prosecuting such acts.

This is all totally consistent with OP’s charge that there is a two-tiered justice system. Personally, I would much rather see the man who murdered OP’s neighbor in jail than Luigi Mangione.

gdjskshh · on Dec 11, 2024

The average citizen is more at risk of being the victim of random crime than they are being the victim of a targeted assassination.

rockorager · on Nov 29, 2024

It could definitely work for that. You have to think of zig in this context as a build system which can download and compile C/C++ codebases.

rockorager · on Nov 4, 2024

Exactly - these should be reported along with any other input event as an escape sequence. There are other comments on this post that link to how this is done.

rockorager · on Nov 4, 2024

The single downside being you can get signals for SIGWINCH, while the (legacy) xterm escape sequence requires polling.

I wrote a proposal[0] to have terminals send these updates in-band, instead of requiring polling a-la xterm. So far, foot, ghostty, iterm2, and kitty have implemented this feature on the terminal side. Neovim, notably, also supports this on the client side.

[0]: https://gist.github.com/rockorager/e695fb2924d36b2bcf1fff4a3...

clausecker · on Nov 4, 2024

This stuff sucks because if the application forgets to turn it back off, the terminal pukes random control characters into your shell every time the window is resized.

mrlonglong · on Nov 4, 2024

That's what `reset` is so useful for.