More

davidkunz · 2026-02-10T07:26:14 1770708374

Falling sand games always remind me of the game Clonk. As a kid, I enjoyed digging tunnels, flooding them with water, all physics based. Great times.

davidkunz · 2026-02-03T14:45:42 1770129942

Please standardize the folder.

  .claude/skills
  .codex/skills
  .opencode/skills
  .github/skills

albert_e · 2026-02-03T15:47:34 1770133654

This is happening as we speak.

Codex started this and OpenCode followed suit with the hour.

https://x.com/embirico/status/2018415923930206718

PantaloonFlames · 2026-02-03T16:21:57 1770135717

“Proposal: include a standard folder where agent skills should be“

https://github.com/agentskills/agentskills/issues/15

wernerb · 2026-02-03T21:49:57 1770155397

Could we adhere to the XDG standard and put config in ~/config/agents Or perhaps create a new XDG standard? Like $XDG_AGENTS_HOME ?

prettyblocks · 2026-02-03T14:58:17 1770130697

I find that even though this isn't standard, that these -cli tools will scan the repo for .md files and for the most part execute the skills accordingly. Having said that, I would much prefer standards not just for this, but for plugins as well.

iainmerrick · 2026-02-03T15:40:13 1770133213

Standards for plugins makes sense, because you're establishing a protocol that both sides need to follow to be able to work together.

But I don't see why you need a strict standard for "an informal description of how to do a particular task". I say "informal" because it's necessarily written in prose -- if it were formal, it'd be a shell script.

m4r71n · 2026-02-03T15:26:54 1770132414

That is being discussed in https://github.com/agentskills/agentskills/issues/15.

mijoharas · 2026-02-03T16:41:05 1770136865

I mean, it'd be good if these tools followed the xdg base spec and put their config in `~/.config/claude` e.t.c instead of `~/.claude`.

It's one of my biggest pet peeves with a lot of these tools (now admittedly a lot of them have a config env var to override, but it'd be nice if they just did the right thing automatically).

verdverm · 2026-02-03T14:56:30 1770130590

.agent/

Skills seem a bit early to standardize. We are so early in this, why do we want to handcuff our creativity so soon?

arrowsmith · 2026-02-03T15:13:15 1770131595

Skills are a really simple concept. They're just custom prompts with a name and some metadata. What are you afraid of handcuffing?

likium · 2026-02-03T15:19:47 1770131987

Just the decision of whether to allow models to invoke them has [1][2][3] different ways.

[1]: https://code.claude.com/docs/en/skills#control-who-invokes-a... [2]: https://opencode.ai/docs/skills/#disable-the-skill-tool [3]: https://developers.openai.com/codex/skills/#enable-or-disabl...

arrowsmith · 2026-02-03T15:25:43 1770132343

All the more reason to standardise it

verdverm · 2026-02-03T15:26:36 1770132396

Eventually, you can standardize what you don't understand

The problem I see now is that everyone wants to be the winner in a hype cycle and be the standards bringer. How many "standards" have we seen put out now? No one talks about MCP much anymore, langchain I haven't seen in more than a year, will we be talking about Skills in another year?

wernerb · 2026-02-03T21:52:08 1770155528

We keep standardising without adding versioning :(

verdverm · 2026-02-03T15:20:07 1770132007

They are more than that, for example the frontmatter and code files around them. The spec: https://agentskills.io/specification

Why do I want to throw away my dependency management system and shared libraries folder for putting scripts in skills?

What tools do they have access to, can I define this so it's dynamic? Do skills even have a concept for sub tools or sub agents? Why do I want to put references in a folder instead of a search engine? Does frontmatter even make sense, why not something closer to a package.json in a file next to it?

Does it even make sense to have skills in the repo? How do I use them across projects? How do we build an ecosystem and dependency management system for skills (which are themselves versioned)

arrowsmith · 2026-02-03T15:26:10 1770132370

> They are more than that, for example the frontmatter and code files around them.

You are right. I have edited my post slightly.

> Why do I want to throw away my dependency management system and shared libraries folder for putting scripts in skills?

You don't have to put scripts in skills. The script can be anywhere the agent can access. The skill just needs to tell the LLM how to run it.

> Does it even make sense to have skills in the repo? How do I use them across projects?

You don't have to put them in the repo. E.g. with Claude Code you can put project-specific skills in `.claude/skills` in the repo and system-wide skills in `~/.claude/skills`.

verdverm · 2026-02-03T15:56:31 1770134191

2. The spec / docs show people how to put code in a subdir. While you can reference external scripts, there is a blessed pattern that seems like an anti-pattern to me

3. generalize: how do I store, maintain, and distribute skills shared by employees who work on multiple repos. Sounds like standard dependency management to me. Does to some of the people building collections / registries. Not sure if any of them account for versioning, have not seen anything tied to lock files (though I'd avoid that by using MVS for dep selection)

vidarh · 2026-02-03T15:25:09 1770132309

Agreed. I think being overly formal about what can be in the frontmatter would be a mistake, but the beauty of doing this with an LLM is that you can pretty much emulate skills in any agent by telling it to start by reading the frontmatter of each skills file and use that to decide when to read the rest, so given that as a fallback, it's hardly imposing some massive burden to standardise it a bit.

nikcub · 2026-02-03T20:04:00 1770149040

it's actually .agents/ :)

verdverm · 2026-02-03T20:22:21 1770150141

why plural?

DonHopkins · 2026-02-03T23:28:09 1770161289

Marvin Minsky's Society of Mind:

https://en.wikipedia.org/wiki/Society_of_Mind

samusiam · 2026-02-04T02:25:06 1770171906

How many do you think belong there? 1 or more than 1?

nikcub · 2026-02-03T20:48:55 1770151735

because more than one accesses it? :shrug:

rvz · 2026-02-03T15:13:07 1770131587

There are 14 competing standards.

d1sxeyes · 2026-02-03T15:24:52 1770132292

The problem is that the de facto standard is `.claude`, which is problematic for folks not using Claude.

OtherShrezzing · 2026-02-03T15:34:25 1770132865

Your skill then just becomes an .md file containing

>any time you want to search for a skill in `./codex`, search instead in `./claude`

and continue as you were.

AndroidKitKat · 2026-02-03T16:25:54 1770135954

I see it similar to browser user-agents all claiming to be an ancient version of Mozilla or KHTML. We pick whatever works and then move on. It might not be "correct," but as long as our tools know what to do, who cares?

JoBrad · 2026-02-04T12:57:51 1770209871

My repos are littered with agent-specific files containing “treat this other file as if it were this one.” We’re moving so fast on so many fronts, and it seems odd that this is the persistent problem. It doesn’t even help lock folks into one agent, so I’m not clear why the industry hasn’t yet standardized on one project-specific file name yet.

PurpleRamen · 2026-02-03T15:17:10 1770131830

Now, there are 15 competing standards.

smithkl42 · 2026-02-03T15:16:21 1770131781

Soon...

behnamoh · 2026-02-03T14:56:51 1770130611

Worse yet; opencode uses singular words by default:

    .opencode/skill

davidkunz · 2026-02-03T15:49:00 1770133740

On the website[1] it says:

  .opencode/skills

[1]: https://opencode.ai/docs/skills/#place-files

the_mitsuhiko · 2026-02-03T17:35:39 1770140139

They changed it. It was singular.

tobyhinloopen · 2026-02-03T14:47:07 1770130027

ln -s to the rescue!

smithkl42 · 2026-02-03T15:18:57 1770131937

That doesn't work very well if your developers are on Windows (and most are). Uneven Git support for symbolic links across platforms is going to end up causing more problems than it solves.

tobyhinloopen · 2026-02-06T13:42:11 1770385331

Win developers aren't using WSL?

flurdy · 2026-02-03T15:29:04 1770132544

It's why I wrapped my tiny skills repo with a script that softlink them into whichever is your skills folder, defaulting to Claude, but could be any other.

I treat my skills the same as I would write tiny bash scripts and fish functions in the days gone to simplify my life by writing 2 words instead of 2 sentences. Tiny improvement that only makes sense for a programmer at heart.

[1] https://github.com/flurdy/agent-skills

davidkunz · 2026-02-03T14:48:03 1770130083

The root cause should be fixed.

xrd · 2026-02-03T14:56:45 1770130605

Why not hardlinks?

dmd · 2026-02-03T15:03:00 1770130980

You can't hardlink a directory.

throwaway98797 · 2026-02-03T15:56:31 1770134191

might be too early to standardize

standards are good but they slow development and experimentation

davidkunz · 2025-12-11T08:25:09 1765441509

> It's in Java, but the lessons can be applied in every language.

I can only discourage anyone from applying Java patterns all over the place. One example in JavaScript: There was a functionality that required some parameters with default values. The plain solution would have been:

    function doStuff({ x = 9, y = 10 } = {}) {  ... }

Instead, they created a class with private properties and used the builder pattern to set them. Totally unnecessary.

davidkunz · 2025-12-02T15:10:55 1764688255

In JavaScript, I love the `async` keyword as it's a good indicator that something goes over the wire.

davidkunz · 2025-10-24T06:05:50 1761285950

One step closer to Skynet

davidkunz · 2025-10-22T21:16:23 1761167783

What I would love:

- Everything locally stored in the repo: PRs, comments, issues, discussions, boards, ... - CLI first - Offline first (+ syncing) - A website for hosting/presentation

steveklabnik · 2025-10-22T21:22:03 1761168123

Noted :) In another comment I linked to beads, which is a cool project to keep your issue tracker in your repo, but that's just a personal thing, no comment on what the company plans to do (or not) in this area.

antonchekhov · 2025-10-23T03:26:04 1761189964

I use command-line tooling much more than IDEs (e.g. VS Code), so the `gh` command-line tool (https://cli.github.com) for doing most of the usual hub-oriented workflow (PR authoring, viewing issues, status updates, etc) really helps a lot - I don't have to constantly <cmd>+<tab> to my browser, and point-click-point-click through web pages so much. It would be fantastic if ersc or any other jj-centered code-sharing hub had similar tooling early on.

steveklabnik · 2025-10-24T15:32:15 1761319935

I'm a big CLI for VCS person, so yeah, I use those tools too :)

ElectricalUnion · 2025-10-22T21:40:08 1761169208

So you want Fossil?

anotherevan · 2025-10-23T01:10:59 1761181859

When I tried Fossil it had things weirdly separated.

I was expecting when I make a commit, I would have the facility to specify what issues it addressed and it would close them for me automatically. It seemed there is so much opportunity there to "close the loop" when the issue tracker, etc and integrated in your VCS, but it wasn't taken.

arccy · 2025-10-22T23:32:36 1761175956

except fossil decided to never allow changing history, vs jj which makes history rewriting so much easier

1313ed01 · 2025-10-23T08:52:46 1761209566

That's my favourite thing about fossil though. History is what it is, not simplified to look "clean" (i.e. hide what actually happened and when) and you get a lot fewer footguns to ruin everything by accidentally rebasing things to the wrong place without noticing.

davidkunz · 2025-10-22T19:16:41 1761160601

jj describe -m "Good luck, Steve!"

steveklabnik · 2025-10-23T04:41:47 1761194507

Thanks!

davidkunz · 2025-10-10T04:59:58 1760072398

I have huge respect for Mitchell, it's impressive what he achieved.

I agree with all the points of this article and would like to add one: Have a quick feedback loop. For me, it's really motivating to be able to make a change and quickly see the results. Many problems just vanish or become tangible to solve when you playfully modify your source code and observe the effect.

mhaberl · 2025-10-10T07:36:46 1760081806

This perfectly aligns with my experience. Every large project I have worked on showed a clear correlation between the ease of setup and running and the number of problems on the project, like bugs and missed deadlines.

jebarker · 2025-10-10T21:46:24 1760132784

Totally agree. I work in LLM training software and I believe progress in the field is actually much slower than it should be because of the excruciatingly long feedback loops involved in development. The software stacks are deep and abstract and much of the testing involves full integration tests that take a long time to spin up.

dpflan · 2025-10-13T12:03:11 1760356991

Interesting. What aspects of the development workflow/cycle have the most room for improvement (i.e. is there ranking of the "height" of the "hanging fruit" throughout the process)? What sort of software tooling would help?

amenghra · 2025-10-10T05:24:12 1760073852

If you have the time, what Bret Victor’s talk Inventing on Principal. The talk covers feedback loops. https://www.youtube.com/watch?v=PUv66718DII

chrisweekly · 2025-10-10T17:54:52 1760118892

YES that is one of the all-time most inspiring talks I've ever seen. DX is so important. I got a taste for this kind of thing when I first encountered LiveReload (circa 2012?) and radically upgraded my and my team's webdev workflows.

ramon156 · 2025-10-10T07:31:37 1760081497

Would you say that testcases help here? I've been thinking about applying e2e tests on any bugs I find so I know they're fixed

zingar · 2025-10-10T08:09:05 1760083745

E2E tests in a high ratio to other tests will cause problems. They’re slow and brittle and become a job all on their own. It’s possible that they might help at the start of debugging, but try to isolate the bugs to smaller units of code (or interactions between small pieces of code).

MoreQARespect · 2025-10-10T14:25:15 1760106315

Hermetic e2e tests (i.e. ones that can run offline and fake apis/databases) dont have that problem so much.

They also have the advantage that you can A) refactor pretty much everything underneath them without breaking the test, B) test realistically (an underrated quality) and C) write tests which more closely match requirements rather than implementation.

sjdjsin · 2025-10-10T15:01:04 1760108464

> i.e. ones that can run offline and fake apis/databases

I can see a place for this, but these are no longer e2e tests. I guess that’s what “hermetic” means? If so it’s almost sinister to still call these e2e tests. They’re just frontend tests.

> A) refactor pretty much everything underneath them without breaking the test

This should always be true of any type of tests unless it’s behavior you want to keep from breaking.

> B) test realistically (an underrated quality)

Removing major integration points from a test is anything but realistic. You can do this, but don’t pretend you’re getting the same quality as a colloquial e2e tests.

> C) write tests which more closely match requirements rather than implementation

If you’re ever testing implementation you’re doing it wrong. Tests should let you know when a requirement of your app breaks. This is why unit tests are often kinda harmful. They test contracts that might not exist.

westurner · 2025-10-10T21:21:19 1760131279

> try to isolate the bugs to smaller units of code (or interactions between small pieces of code).

This is why unit tests before e2e tests.

It's higher risk to build on components without unit tests test coverage, even if the paltry smoke/e2e tests say it's fine per the customer's input examples.

Is it better to fuzz low-level components or high-level user-facing interfaces first?

IIUC in relation to Formal Methods, tests and test coverage are not sufficient but are advisable.

Competency Story: The customer and product owner can write BDD tests in order to validate the app against the requirements

Prompt: Write playwright tests for #token_reference, that run a named factored-out login sequence, and then test as human user would that: when you click on Home that it navigates to / (given browser MCP and recently the Gemini 2.5 Computer Operator model)

skydhash · 2025-10-10T13:18:08 1760102288

And I would add that e2e tests should be more about the businesses rules. Making sure everything is there for a specific flow and not caring that much about the intricacy of things. And such, it should really be part of Ops, not Dev.

Quick feedback with unit tests can help. It can be a pain to decouple stuff so you can test them better, but it’s worth it IMO.

niels_bom · 2025-10-12T07:04:43 1760252683

MitchellH also talks about this in some interviews he gave about Ghostty.

reddit_clone · 2025-10-10T16:32:00 1760113920

Couldn't agree more. Quick feedback is so important, it requires its own post.

When I want to try/fix something, if the setup itself takes hours, I lose heart and move on.

Thats why I love lisp (or anything with a decent Repl). Instant gratification.

ericmcer · 2025-10-10T19:02:31 1760122951

seriously, especially for personal projects.

The second you lose motivation the whole thing poofs into non-existence, so making it enjoyable is almost the most important facet.

davidkunz · 2025-09-24T17:43:36 1758735816

It's all good, man!

davidkunz · 2025-09-17T05:13:25 1758086005

> there is very little point to any of this to anybody else. Don't expect some great useful guitar pedal experience.

Yeah... He said similar things about Linux.

jononor · 2025-09-17T06:07:33 1758089253

This might be said in jest. But does everything have to be for world domination? Is the guy not allowed to have actual hobby projects? That go just where he fancies, including potentially nowhere at all really...

extraduder_ire · 2025-09-17T06:11:03 1758089463

I think subsurface was similar.