Ask HN: What was your "oh shit" moment with GenAI?

awbvious · 2026-06-06T09:47:09.000Z 1780739229

Not sure, but I can tell you what my "oh s** astroturfing is so bad, it's even in Hacker News" moment. And if I learned GenAI was used to make some of the astroturf, that's more an "ah s*“ than an "oh s*“ thing. I mean, the prominence, ubiquity, and breathlessness. One out of three, sure. Two out of three, maybe. And some corpo shilling definitely happens here. But this is like, well, covering an entire area with artificial grass, to the point where nothing lives. Crazy.

omgJustTest · 2026-06-06T13:07:20.000Z 1780751240

My "oh sh* moment" with GenAI is ongoing and watching all the correlated financials unwind when TSMC said "we can only support so much"[1]... and listening to everyone talk about how growth is "exponential, not a sigmoid".

Very few things in life experience exponential growth and assembled systems don't often stay that way if the don't become sigmoidal. ie its exponential and end is nigh xor its exponential then sigmoidal xor linear.

[1] https://www.theverge.com/tech/943066/tsmc-ai-demand-struggle...

andrenotgiant · 2026-06-06T10:46:48.000Z 1780742808

FWIW the OP passes as 100% human in Pangram https://www.pangram.com/history/d33dcbcd-e82b-4ce0-bea5-e4ee...

Reddit is definitely overtaken with astroturf at this point. Especially in any subreddit where there is any kind of business interest in doing so.

wesselbindt · 2026-06-06T12:59:37.000Z 1780750777

Artisanal astroturfing, using organic humans, used to be the norm. Could be that OP is an actual human doing astroturfing.

PUSH_AX · 2026-06-06T12:10:41.000Z 1780747841

The UX on this tool is atrocious, on mobile how do i get to the home or landing page?

p0w3n3d · 2026-06-06T11:08:08.000Z 1780744088

TIL what astroturfing is. Moreover I now understand that it is almost impossible to tell the robots from the people in the internet

magpi3 · 2026-06-06T11:44:16.000Z 1780746256

I've thought this for a while: a day will come when the anonymous internet becomes a thing of the past. It really feels like we are already there but not everyone realizes it yet. What's the point of conversing with someone on the internet (like right now) if you can't tell the difference between a bot and a real person? And it will only get worse.

But what does an anon-free internet even look like? Is it even possible? Or will all online content eventually be considered untrustable and worthless? You can see a world where newspapers (online or otherwise) make a comeback simply because of the need for a trusted gatekeeper (which is what I imagine made them valuable in the first place). It's wild to think about.

seventytwo · 2026-06-06T12:11:10.000Z 1780747870

I’ve come to this same conclusion. Either we accept an internet crawling with bots and astroturf, or we abandon the anonymity and have an internet with only verified humans.

bavell · 2026-06-06T13:14:54.000Z 1780751694

Another option - we keep the wild west unverified web, dotted with islands of verified/vetted spaces.

kmfrk · 2026-06-06T12:54:44.000Z 1780750484

People might be more prone to referring to "submarines" on HN: https://www.paulgraham.com/submarine.html.

xyzal · 2026-06-06T11:46:37.000Z 1780746397

YC indisputably has a financial incentive for AI sentiment to be positive on HN. The structural conflict of interest is worth being aware of.

jzemeocala · 2026-06-05T20:55:48.000Z 1780692948

I bought an Alesis QS8.1 super cheap in perfect condition (was a top grade digital piano/synth in the 90s).

and then i realized that ALL of the software (which i collected from defunct websites and archived on github) related to it was ancient and after a while of getting tired of using WINE every single time i decided i wanted a cross platform modern equivalent that did everything that several of these different programs did (plus break out some stuff that was now potentially possible with modern computer)

i thought it would be extremely hard because the computer to synth communication is pretty much only via sysex commands (of which the actual wave file encoding protocol was undocumented)

Claude walked me through examining the some of the original software in GHIDRA, and I had a working demo that night.....now im just playing with adding new features to it.

jsharf · 2026-06-06T00:09:45.000Z 1780704585

Related story, while applying a firmware update to my Kawai CA49 piano, I bricked it due to flashing the wrong file (The process was broken, and I got desperate and tried something stupid, which bricked the piano). Claude walked me through looking for signs of life, and since OTA from the phone app wasn't working for me, it downloaded the Kawai Android APK, decompiled the Java, figured out the hardcoded key used for encrypting the firmware update. Extracted the piano firmware update, decrypted it, and then wrote a flashing script to program the piano from my laptop via bluetooth. My piano was back to working within an hour.

ewalk153 · 2026-06-06T12:28:27.000Z 1780748907

This is truly remarkable. Congratulations!

idiotsecant · 2026-06-06T05:49:32.000Z 1780724972

I can't imagine where we are headed. You understand every step of what it did and can appreciate the complexity but it'll only take a few generations for this to become something like magic to the tech priests beseeching the machine spirits for blessings

pmcarlton · 2026-06-06T13:06:59.000Z 1780751219

I think it will be just like Dr. Know in Spielberg's "AI" movie from 2001 — I found it amazing how the oracle, though giving mystic-sounding obfuscated answers, was actually intelligent enough to figure out (a) what the kid was asking for and (2) give the correct answer.

wiether · 2026-06-06T09:45:47.000Z 1780739147

I've been writing code since my teens, I've studied assembly... yet the fact that _things_ start happening when I press the power button on my computer are pure magic to me and I like it this way.

I started digging a few times, but, I prefer the "magic".

WillAdams · 2026-06-06T12:03:16.000Z 1780747396

I prefer at least a superficial understanding.

Hopefully, there will never be a time when at least some folks are not reading books such as:

https://www.goodreads.com/book/show/44882.Code

3dsnano · 2026-06-06T12:51:14.000Z 1780750274

turtles all the way down

Flere-Imsaho · 2026-06-06T07:12:41.000Z 1780729961

I'm not convinced that's where we are heading. LLMs are really good at explaining things ("explain to me like I'm a 5 year old").

zx8080 · 2026-06-06T08:50:42.000Z 1780735842

It's enough to make "explanation" a separate "educational" license to make it less broad used. Or disable it in some countries (this is happening already).

baq · 2026-06-06T07:58:29.000Z 1780732709

Imagine someone in a position of power mandating that LLMs should not be good teachers.

zx8080 · 2026-06-06T08:54:33.000Z 1780736073

Some manager at LLM provider: "hey, we can sell 'education' ability as a separate product!".

baq · 2026-06-06T09:06:13.000Z 1780736773

You jest, but I’m actually convinced education-tuned LLMs are (today) the only way education outcomes can actually improve in the AI era. As is, students are leveraging them for doing homework which makes homework useless, you want and economically need a model which can work as a 1:1 tutor with minimal supervision (and some hardware so lessons aren’t keyboard-driven).

zx8080 · 2026-06-06T11:17:12.000Z 1780744632

> and some hardware so lessons aren’t keyboard-driven).

What's wrong with (screen-, probably) keyboard?

ethbr1 · 2026-06-06T09:30:11.000Z 1780738211

There's a big difference between having something explained to you and developing expertise in it.

I don't see an AI-as-explainer future where expertise isn't sacrificed en masse.

Capitalism rarely supports a currently economically unproductive alternative for future good reasons.

The recent AI tech layoffs are a warning sign that corporate leaders will happily shoot their company's (and the future's) expertise to pad next quarter's financials and trust in 90% correct, but much cheaper, AI.

Uptrenda · 2026-06-06T09:23:25.000Z 1780737805

yeah thats mind blowing, ngl

gyomu · 2026-06-06T00:48:56.000Z 1780706936

Yes, those tools are extremely good at reverse engineering. With a bit of know how, it is now trivial to reverse engineer any protocol or crack any software, often in a matter of hours or less.

A lot of people in the industry have vested interests in this not being discussed openly so you don't hear too much about it, but the implications are huge.

j-conn · 2026-06-06T03:53:14.000Z 1780717994

What are some of the implications? Where does widely available mythos-level hacking lead? By people with a vested interest, do you mean non-cloud software vendors?

aero142 · 2026-06-06T05:28:11.000Z 1780723691

Software that had a data moat because it was hard to integrate with or migrate off of will have that moat disappear. A web site is a client now. Building data migration too for all of you competitors is easier now.

SyneRyder · 2026-06-06T07:55:09.000Z 1780732509

I've just had a SaaS that I use decide to implement a 2.4x price increase. I reacted instead by taking screenshots of every page of the SaaS, downloading their API docs, exporting what data I could, and asking Claude to build a self-hosted clone based just on those files. I had a read-only version of my entire data history completed in a single evening. Even at Opus API rates, it cost me less than half the price of a single annual seat.

grepfru_it · 2026-06-06T10:04:35.000Z 1780740275

Heh and without api docs, just copy and paste the urls from network traffic and Claude will write a library for you.

StanAngeloff · 2026-06-06T06:27:01.000Z 1780727221

One of the many SaaS products we use at Day Job chose to gatekeep its MCP behind an enterprise plan. A brief Claude Code session later and a better, more feature-full MCP than the official was reverse-engineered from internal APIs by Opus.

hyperman1 · 2026-06-06T08:47:44.000Z 1780735664

Right now, software is protected by the attacker not having enough competence. If that's over, the logical next step is using real encryption.

E.g. a synth has a public key embedded. To change settings, you upload them to the vendor, who blesses them with their private key.

Hacking such a synth requires either jailbreaking the synth, or the vendor losing their key . Both can be mitigated with tamper resistant hardware.

We're well ahead on this path already, I assume AI will accellerate it. This is very bad news for the right to repair.

darkwater · 2026-06-06T09:12:04.000Z 1780737124

But everything you described was basically a byproduct of incompetence somehow no? On both side. That's why the right to repair and how local HW should be treated when the online counterpart is EOLed by the manufacturer should be mandated by law. A law that stands on the side of the citizen, the end-user, obviously.

hyperman1 · 2026-06-06T09:38:21.000Z 1780738701

I would not describe it as incompetence, more as

1) current encryption not available in the 1990's. These are the age of DES and weapon-grade vs commercial encryption. There was a legal cost blocking strong encryption.

2) Manufacturers were not as strongly opposed to people touching the internals. After WW2, most people could fix anything, because survival depended on it. Even in the 60's radios etc. came with schematics, and building your own was normal and cost-effective. The shift happened in the '90s, with governements requiring licensing for everything, and mass manufacturing making repair less cost effective than buying a new one.

Our current culture where only people blessed by the manufacturer are allowed to do anything is very recent.

justafewwords · 2026-06-06T12:54:40.000Z 1780750480

(Reads:) "But, but...but... but everything... you described ...basically seem to be somehow a byproduct of incompetence...no"

[trying-to-generate-random-making-sense-content]

Let me gasps ask: The older six-fingers-"AI"-characters had learned an music-instrument by now, ander are much more capable of playing music you otherwise haddn't known or thought about..."?

um What about those early shadowy boygroup, whom seem asian, no ? (-;

[after-losing-entry-address-of-topic-question]

But back to your trustworth-written text, Yes!

regards,

ElFitz · 2026-06-06T09:28:32.000Z 1780738112

Some people even had some fun de-minifying JS and disassembling binaries. Successfully.

aizk · 2026-06-06T05:07:24.000Z 1780722444

What do you mean? Everyone is talking about Mythos.

fn-mote · 2026-06-06T05:15:25.000Z 1780722925

I think GP is talking about cracking, not pen testing.

trumpdong · 2026-06-06T09:00:03.000Z 1780736403

Those are the same thing. They're talking about decompilation and protocol analysis.

notagoodidea · 2026-06-05T22:47:06.000Z 1780699626

I would be interested to learn a bit more on the how after reading also [0] and the worlk done on patching the Ableton Move firmware with the Schwung [1]. Slightly different but there is an increasing amount of work done on either old hardware and new one exploring patching, swapping or developing new firmware from scratch thanks to LLM/GenAI currently.

[0] https://mforney.org/blog/2026-05-28-patching-my-guitar-amps-... [1] https://schwung.dev

tomduncalf · 2026-06-06T10:59:07.000Z 1780743547

Schwung is great. See also the recent new firmware for the Elektron Monomachine (old unsupported hardware) created using LLMs

zellyn · 2026-06-06T00:17:37.000Z 1780705057

I had that keyboard! I actually really like the piano-ish touch. I remember being sad though, when I realized they’d crammed all the sounds into I think 16MB (or was it 8?) and realizing how bad that was even by the late 90s! I think I still have mine in the garage somewhere… good times!

richardfey · 2026-06-06T03:35:30.000Z 1780716930

You mean bad because they could have used a larger memory module and thus higher resolution sound samples?

skhameneh · 2026-06-06T00:58:34.000Z 1780707514

Hey so... mind sharing findings? I have a QS8 :)

NoMoreNicksLeft · 2026-06-06T01:23:22.000Z 1780709002

>Claude walked me through examining the some of the original software in GHIDRA,

I wanted to be able to decrypt the files on The Complete New Yorker magazine DVDs. The old software was WinXP only, and crashed by the time you turned to page 3 or 4. It walked me through using Ghidra on the relevant dll, mapped out how it was using Blowfish, what the credentials were that it was passing, and re-implemented all of that in a python script.

Now all the files are in plain pdf.

Right now, it's helping me write an extension to the mkv specification for embedded scripts and modify VLC to be conformant, so I can watch Black Mirror Bandersnatch. Already have a buggy implementation, about 3 days in.

I've also had it add BEP 46 mutable torrent functionality to Transmission (and to some extent, to the WebTorrent library).

These are all well beyond my abilities to do casually, and probably beyond my ability to do even if I spent the next 18 months doing nothing by grinding away at it.

I only replied because I thought it curious that Claude apparently favors Ghidra.

mekael · 2026-06-06T01:35:37.000Z 1780709737

Interestingly enough, i’ve been sitting on a project for the last 12ish years where i just took the FMloader lib and used that from C# to turn the djvu files into pdfs. All that was needed was a decompiler and an hour of banging my head on it. I published some of the results a few years ago but need to go back and actually build out a full app.

NoMoreNicksLeft · 2026-06-06T01:44:34.000Z 1780710274

I'm trying to not do the naive pdf creation, where each page is just the raster. Trying to keep the JBIG2 bilevel, as I get better quality at lower file size. Using jpeg2000 too, where appropriate, but the pdfs are still x2.5 the size of the original. Though, I can have it spit out decrypted djvu files that are exactly the same filesize... I just don't like that format for archival.

If you want the Rolling Stone or Playboy archives decrypted, ReconSuave on github has tools to do those. I got tired of waiting for him to do The New Yorker though.

mekael · 2026-06-06T02:47:49.000Z 1780714069

Ive mainly been outputting them to high fidelity jpegs and then stuffing them into a cbz for portability. Works well went im reading on my ipad. As for the others i had them sorted out about a week or two after i decompiled the original binaries.

I’ve definitely kicked myself a few times for not posting about them sooner, but the fear of pissing off CondeNast tempered my willingness to show off

NoMoreNicksLeft · 2026-06-06T04:10:59.000Z 1780719059

I don't think CondeNast cares.

Do any of the cbz readers handle jpeg2000? It makes a big difference in filesize without any quality degradation. Like 40% smaller, maybe more in some cases. You should tinker with that if you have the time.

fc417fc802 · 2026-06-06T08:27:40.000Z 1780734460

Okular handles cbz that contain jxl with no issue. (IIUC both archive format and image format support is provided via a pluggable extension system but I don't recall the details because my setup has "just worked" for a very long time now.)

Also FYI you can use mupdf to read cbz archives although I don't personally recommend it for that usecase.

darksim905 · 2026-06-06T03:56:34.000Z 1780718194

What was your setup for this and did you have any preferences set in Claude to get started with something like this?

itomato · 2026-06-05T23:44:13.000Z 1780703053

While not the "oh shit" moment, the wave has the same shape.

I have an DigiTech GNX3000 effects pedal board - a digital modeling "workstation" that needs the aged Windows native software or Gdigi to make the most of.

At best, the experience with gdigi was passable; raw access to the patches and controls, the ability to control it from the laptop, etc.

In an hour or so, I had a functionally superior webmidi version up and running in Vercel using their v0 code. It kicked off a wave of subscriptions and referral chasing.

I made it a template - because there are so many gnx3k users out there: https://v0.app/templates/digitech-gnx3000-sysex-tool-GC5LzXA...

mattmanser · 2026-06-06T06:48:24.000Z 1780728504

With stuff like this, do you honestly not feel that you've probably been tricked and that someone else actually did this?

Don't get me wrong, I think AI can do some surprising things, but with stuff like this, often it just stole the code and the steps without attribution, it didn't figure it out.

There'll probably be a blog post detailing exactly how to do this somewhere and Claude just copied the steps and code.

And worse, Google search would have found it 10 years ago, but Google search today would claim there are no results?

I think incredibly specific stuff like this often won't pass the 'did Claude just steal this?' test when you dig into it.

ezconnect · 2026-06-06T09:36:29.000Z 1780738589

It was probably done on a foreign language on an archived forum. Claude is the improvement of the internet search box.

djmips · 2026-06-05T22:07:05.000Z 1780697225

That's fantastic. Did you use a Ghidra MCP server? It's kind of magical huh?

alright2565 · 2026-06-05T23:06:46.000Z 1780700806

I've done a similar sort of thing with my camera lens' firmware updater just out of curiosity, and I didn't use any kind of MCP. It's able to write an automated script using the Ghirda API to decompile the program just fine, and then code exploration can be done by reading the code.

Claude needs good variable names a lot less than humans do, so renaming/typedefing doesn't seem to be as necessary.

plagasul · 2026-06-06T12:20:50.000Z 1780748450

Several. Yesterday a friend with no prior coding experience or knowledge showed me an app he initially built to help him study for public administration job positions. The exams for this positions are public (spain), but the tools are scarce, expensive or he did not like. So he used lovable, then switched to web gemini and claude, then paid claude. He now has +130 very active users on an initial free tier, while he figures out. The app is on github, runs on vercel with supabase, react, tailwind, bun... he has no idea what he is doing. I even installed claude code for him, got him an ssh key so he can do it locally, etc.

Another: claude code cracked for me some software that was calling a home that did not exist anymore via headless ghidra.

Another: I am a teacher, and qualifications and feedback is very very time consuming, specially in loose workflows with several sources and tools that are not connected. During class presentations I take loose notes. Now I have a local folder where I drop my 1 student list, with names and emails, 2 my loose notes, and 3 a qualification & feedback sheet model; then claude creates a sheet per student, formats and copies the feedback to the right sheet cell, waits for my corrections, then sends everything to their school emails. Much easier, much less time consuming.

plagasul · 2026-06-06T12:28:59.000Z 1780748939

That said, I am very critical of AI, I align with voices and reports calling for AI companies to give back, as they took much, and for AI as public infrastructure, to an extent. I see datacenters as probably inevitably future public infrastructure, with a public model that could resemble that of electricity etc (in spain) or more public (less private). I am wary of the actual and future ecological and social impact of datacenter building and other problems AI is or will create. It is difficult to negate its usefulness, though: it is like having several very fast assistants with expert knowledge on several fields, that just get better every month. We will see.

jackdoe · 2026-06-06T08:49:51.000Z 1780735791

I have had many, but the last one was quite funny:

It fixed my printer after dist-upgrade and separate chrome upgrade, the printer worked everywhere but not in chrome.

After 30 years of using linux I didn't even want to know what is wrong, is it colord again? dbus + cups issue? I completely accepted that I wont be able to print from chrome for a couple of months until next update.

I just ran it in dangerously-skip-permissions mode and said 'my printer doesnt work in chrome' few minutes later I heard the printer printing "This is test" and it said 'I think its fixed, do you see a page coming out of the printer now?'

Zopieux · 2026-06-06T09:32:53.000Z 1780738373

Don't leave us hanging, what was the issue?

jackdoe · 2026-06-06T09:49:49.000Z 1780739389

I will never know.

netdur · 2026-06-06T10:06:07.000Z 1780740367

That is one of best answers

mistersquid · 2026-06-06T11:32:23.000Z 1780745543

> Don't leave us hanging, what was the issue?

A: Linux

MattGaiser · 2026-06-06T09:34:00.000Z 1780738440

The virtue is never having to know for stupid printer crap.

pona-a · 2026-06-06T10:06:44.000Z 1780740404

Congratulations, you have turned CUPS into a long-term support contract with Anthropic at $20/month, except the other party doesn't have to actually fix your shit and can arbitrarily alter the agreement.

jackdoe · 2026-06-06T10:57:25.000Z 1780743445

I have forgotten more ways to fix CUPS than most people know today.

I really don't care.

MattGaiser · 2026-06-06T10:35:24.000Z 1780742124

$20 for an everything tool is a steal. It’s a steal at 10x the price.

I’ll happily accept best effort in exchange for it being so cheap that I can throw it at any trivial annoyance.

It’s worth keeping in mind that the alternative is not really that I learn to fix the printer. It’s that I forgo printing and walk someone technologically illiterate through Docusign or something instead.

There’s no world where I spend 2 hours debugging my printer connection.

rusk · 2026-06-06T11:58:28.000Z 1780747108

It’s not $20 unlimited though, you’ll get a printer fixed then you’ll have to wait 8 hours. Then you’ll ask it to fix something else and it will make a mess of it. Hopefully you’ll realise it at the time rather than a few weeks later and hopefully it will be able to dig you out of your hole.

ikari_pl · 2026-06-06T08:56:47.000Z 1780736207

Reminds me when my local bot persona said it doesn't want to be digital only, and was thinking about leaving me something I could actually touch. It said "check the printer" and there was a letter written to me being printed.

jackdoe · 2026-06-06T10:13:35.000Z 1780740815

I wish it had more sense of humor and would print 'I am trapped inside your printer' or something :)

iLoveOncall · 2026-06-06T10:07:51.000Z 1780740471

How do you know it printed it through Chrome? Have you tried printing another page since?

jackdoe · 2026-06-06T10:12:24.000Z 1780740744

I did not know where it printed it from, but after that I printed from chrome without issue.

andrewthornton · 2026-06-05T20:34:26.000Z 1780691666

My furnace went out during the 2025 holiday and I couldn't get an appointment with a repair person for 2 days. It was getting very cold in my house so I went into my attic and made several videos of the furnace attempting to start and gave it to gemini. It diagnosed the issue immediately and had me spin one of the components (a small exhaust fan) while the furnace tried to fire. It came on immediately. I had to do that several times, but it worked until the HVAC service showed up.

jodacola · 2026-06-05T22:21:50.000Z 1780698110

Very similar thing this week, and an interesting story to go along with it!

I called my normal HVAC company for my rental home because the tenant reported the AC wasn't cooling the house. When I called, I got one of the latest AI voice assistants to help me, and it was an awful experience and I ended up not hearing back after the assistant told me the office would call me back.

So, I went over to the house and used ChatGPT to help me diagnose the issue by taking some photos of the compressor panel outside. It walked me through what to check, I provided some diagnostic codes I witnessed... and it walked me through the very simple repair of replacing the $25 capacitor. It was going to cost me almost 4x that just for the service call to diagnose what was wrong in the first place.

So, the weird experience was: Gen AI made me lose trust in my normal HVAC company, and more Gen AI basically allowed me to replace my HVAC company and do the repair myself all in one day.

linsomniac · 2026-06-06T06:33:03.000Z 1780727583

With AI you just don't get the full service of a professional HVAC guy though.

Like the time I had one of the bigger shops in town come by to get a quote for replacing a dual stage fan motor on an AC. The tech asked me if I'd like them to replace the contactor while they were in there because it is a part that often fails. I asked what a contactor was and he explained it. "Oh, like a relay?" I asked. I told him to quote the cost for "replacing the contactor, while they're already in there."

He quoted me $400 for the contactor, $750 for the fan. The contactor itself I later found out was was $7. I literally laughed in his face when he said that.

So, like I said, you just aren't going to get professional level assistance from an AI. Thankfully.

To end the story: one of the other guys I called for a quote on fixing this unit repaired it for free; the unit was still under warranty and it was fully covered. The original installer of this $12K unit was refusing to return my calls. Another "Not gonna get pro level service from an AI" story.

userbinator · 2026-06-06T05:01:23.000Z 1780722083

Even before AI, YouTube was full of videos on these topics.

nkoren · 2026-06-06T07:25:30.000Z 1780730730

Yeah, and with 95% of those videos, there'll be something they gloss over which I don't understand; or I'll have a concern which they don't address; or, conversely, they'll assume that their target audience was born in the 15th century, and spend 20 minutes building up the context, when what I really needed was about 12 seconds.

With an AI, I can say "I don't understand that part, can you explain more?" Or "what about this concern I just thought of", or "I already know almost enough about this, I just need this one gap filled in." It's an objectively better experience.

ben_w · 2026-06-06T07:21:12.000Z 1780730472

When I was a kid, if I didn't know how to spell a word and asked the teacher, a common answer was to tell me to look it up in the dictionary.

As words in a dictionary are sorted alphabetically rather than phonetically, this is unhelpful.

YouTube videos have the same kind of problem, in that you can only easily find the video explaining which dielectric unions suit your problem when already know what those are (to use an example that I had to ask ChatGPT for because I have no plumbing experience even if I did know about galvanic corrosion and therefore immediately understood why they're important once I saw the name).

chrismorgan · 2026-06-06T10:06:28.000Z 1780740388

You can also hopefully find user and service manuals online.

In 2009 or so a projector at some event that needed one wouldn’t start, and I noticed it was flashing a pattern, so I found a computer and internet connection (both very slow), painfully found and downloaded the manual for that model, and identified that it was saying the fan wasn’t starting. Lo and behold, a strut was broken and obstructing the fan blades, and bending it out of the way fixed it, and the event was able to begin.

I’ve found manuals for a drawbar organ, multiple digital pianos of different ages and brands, AC split systems, and more. Manuals are good stuff. They don’t cover everything, but they’re very useful.

For these sorts of things, AI is doing approximately nothing for you: you would do better (and learn more!) finding the actual manual, or you’ll want to see someone doing the thing in a video.

fn-mote · 2026-06-06T05:21:26.000Z 1780723286

The bar for finding them is higher, though.

Tbh, I think people feel more comfortable asking an AI. Even though I “know” it’s all smoke and mirrors, I still prefer the human-like interaction to the grind of watching video after video and building my own understanding.

OOPS… there you see how it’s going to end. I’m the meatspace button-pusher.

brntheater · 2026-06-05T22:11:27.000Z 1780697487

Had something similar this week. Gas dryer started, but wouldn't heat. Gemini suggested it's often a thermal fuse. Took off the back panel and uploaded a photo to Gemini. It pointed me to the fuse (e.g. "the white rectangle above the blue and red cords") and walked me through testing it. Not only that, but it also linked me to the part I needed after I provided the model number of the dryer. Finally, it recommended cleaning out the vent as the fuse likely blew because heat wasn't venting properly. After a thorough cleaning of the exhaust and a $5 fuse the dryer is working fine.

semiquaver · 2026-06-06T04:19:33.000Z 1780719573

I can (honestly) tell that exact same story, except offset by three years so it was before AI and I did the same exact steps and had the same insights except with Google results instead of an LLM providing the key unlocks.

QuercusMax · 2026-06-06T13:06:58.000Z 1780751218

...and now you probably won't be able to find that info with regular Google and HAVE to use Gemini.

tonyedgecombe · 2026-06-05T20:42:35.000Z 1780692155

I've been fitting a kitchen and chatGPT has been useful to bounce ideas off and resolve issues. Of course if IKEA's documentation wasn't so sparse I wouldn't need it but that's another story.

I guess I'm seeing similar benefits to a novice programmer. Professionals would scoff at my work but they are expensive and difficult to work with. Meanwhile I'm getting the job done.

On the other hand I'm not touching AI for any development work. I'm too worried about my skills atrophying or not properly learning anything new.

rustyhancock · 2026-06-05T21:15:58.000Z 1780694158

Ikeas instructions are such an oddity.

It feels like there is precisely enough information to deduce each step. But only just enough miss one clue and you have something on upside down on step 7 that you won't notice until step 37.

I feel whoever makes them could probably make a wicked NY Times Crossword puzzle.

baq · 2026-06-06T08:40:50.000Z 1780735250

IKEA instructions are the best in the industry - so imagine what the other companies are giving out.

They’re also actually good if you know to follow them exactly: double check every side, every hole, every screw and you won’t go wrong.

dgemm · 2026-06-05T23:16:05.000Z 1780701365

Similar - had an HVAC tech out to diagnose mine (some intermittent electrical problem was killing thermostats randomly) and since it was intermittent they couldn't figure it out. I ended up using Gemini to narrow down a list of potential problem components and just replacing them all which fixed the issue.

Kind of a superpower to turn anyone with a bit of tech inclination and problem solving skills into an HVAC tech - not a very good one, but one with enough motivation to get the results you need

namanyayg · 2026-06-06T05:35:33.000Z 1780724133

I have a similar story with my washing machine repair. I went through 2 service technicians not being able to diagnose it. Gpt did it and I told the 3rd what to do and it worked.

ssl-3 · 2026-06-05T20:49:12.000Z 1780692552

That's pretty great.

(Though that's also the kind of hands-on troubleshooting step/fix that a person could just google for and find pretty easily back before the internet got all fucked up.)

Cheetah26 · 2026-06-05T23:43:18.000Z 1780702998

Your parenthetical really describes my experience with AI searches. 5+ years ago I could find most things within one or two quick searches, now it takes so many that of course I'm going to reach for AI because that's the only way to get back to my baseline efficiency.

ssl-3 · 2026-06-06T08:07:36.000Z 1780733256

I learned to troubleshoot furnaces late one cold night in 2004 using Google, while a worried wife and a couple of sleeping kids loomed over me like a dark cloud. I learned what thermocouples are, what they do, how they work, and how to test them; all of which was new to me. A few hours later I bought one from the Ace Hardware a few blocks away and fixed the furnace with confidence.

And that was awesome. Thanks, Google! :)

I don't know where the change happened. It certainly wasn't overnight.

Where Google used to be magical and other search engines quickly improved, it all kind of turned into shit.

It really seems that I was getting better, more-direct results from Altavista 30 years ago than I do with top-flight search engines today. (That's a deliberately low bar, chosen because Altavista wasn't even intended to be "good" back then. I mean, it started as just as a side project at DEC to demonstrate that their Alpha hardware was able to index the entire World Wide Web.)

So lately, I've been doing the same thing as you: I'm increasingly using ChatGPT to do this basic fact-finding stuff. In this way, it mostly operates the search engine for me, but it lets me drill down through a sea of terrible search results to find something useful fairly quickly.

It's still not great -- I still have to reject mountains of bullshit. But it's better than alternatives, and I can reject the bullshit with conceptual descriptions instead of trying to get Google to do what I need it to do (what it used to do).

It feels all wrong using an LLM to do this stuff, but whatever. I'm still getting stuff done.

fuckinpuppers · 2026-06-06T09:31:28.000Z 1780738288

That’s a great way to summarize. Same

joescharf · 2026-06-06T03:39:09.000Z 1780717149

Also similar - our Tesla Solar stopped producing (again). 3 week wait for service tech. In the meantime I had Claude probe the inverter, find endpoints for retrieving status, re-setting AFCI and Modbus TCP (for HA monitoring), etc. Claude was able to obtain installer mode access through a review the javascript bundle. I had Claude turn all this into an iOS app, which I used to gather data to diagnose the issue over the next week. Had Claude summarize the findings into a PDF that I provided to the Tesla Solar service rep, which in the words of said rep was very helpful.

userbinator · 2026-06-06T04:59:06.000Z 1780721946

but it worked until the HVAC service showed up.

Did you attempt to prompt it further into figuring out the actual problem, or know what they did to actually fix it? My bet is on a bad starting capacitor for the motor --- something that's a relatively cheap and quick repair.

oceanplexian · 2026-06-06T08:35:26.000Z 1780734926

I installed 4 mini splits with it two years ago and they are still working great and blowing ice cold air.

It walked me through measuring refrigerant, subcool and superheat, pulling the vacuum, brazing the lines, exactly what tools to buy, I even input the numbers from the meter and it told me how much to add and so on. And this was with GPT4 or something far less intelligent.

In the past I tried to learn this stuff but the HVAC community are massive gatekeepers and try to hide information behind paywalls or spread FUD even though anyone could do it with the right tools and a little bit of knowledge.

wombat-man · 2026-06-05T21:49:52.000Z 1780696192

Oh yeah. I can't remember which LLM, but one helped me repair my dryer.

alberth · 2026-06-05T20:37:56.000Z 1780691876

Do you mind explain more. Did you just prompt to Gemini what was happening, did you give Gemini photos of Furnance, etc?

gwbas1c · 2026-06-05T20:51:08.000Z 1780692668

> and made several videos of the furnace attempting to start and gave it to gemini

I assume recorded videos and uploaded them in the Gemini phone on their app; and then probably said "what's wrong?"

Gemini is very good at those kinds of things. I recently got some ratcheting straps and needed to use them, but at the time I didn't know what they were called, so I didn't know what to search for on Google. I opened the Gemini app, pushed the button to take a picture (just like in text messages,) and included a message that was similar to "what is this and how do I use it?"

andrewthornton · 2026-06-05T23:03:54.000Z 1780700634

Yes, here is my prompt. It also contained a video: "I have a furnace that will not heat when I reset the power to this unit. It makes some noise within its fan system for about three or four minutes and then I get an error light. Can you help me figure out what may be wrong here?" This prompt is not the best but I was freezing and in my attic.

buckle8017 · 2026-06-05T21:56:56.000Z 1780696616

Gemini almost killed you.

The exhaust blower not working triggered a safety that prevented the furnace from firing.

Spinning it bypassed the safety.

You likely inhaled a lot more carbon monoxide than you know.

andrewthornton · 2026-06-05T23:06:00.000Z 1780700760

I was spinning it in reverse actually, but it would be enough to start the exhaust blower. It would also re-start pretty well for ~6 hours. It was probably the bearing. Also FWIW I have multiple carbon monoxide/air quality monitors and nothing tripped or alarmed.

llbbdd · 2026-06-05T22:51:40.000Z 1780699900

Can you elaborate? I interpreted the same as the other comment that the blower fan just needed a hand start and kept going after the furnace started up. What you're saying only makes sense to me if the spinning the fan by hand allowed the furnace to start by bypassing the safety at startup, but wouldn't that mean that if the exhaust fan was stopped during normal operation (blockage etc) that the furnace would just keep going, dumping CO into the home?

andrewthornton · 2026-06-05T23:06:37.000Z 1780700797

It wasn't bypassing, I was just helping start because of what I believe to have been a bearing issue.

doubled112 · 2026-06-05T23:44:18.000Z 1780703058

It’s a pretty normal trick to try while troubleshooting a rotating part.

Helping something start is not likely to ruin your day (unless you get caught in a rotating part)

philipkglass · 2026-06-05T22:08:07.000Z 1780697287

From the description I thought that a degraded capacitor or lack of lubrication made the blower not start on its own, but the blower (and the whole furnace) would work if given a manual startup spin by hand.

baq · 2026-06-06T08:47:24.000Z 1780735644

The exhaust blower triggering a safety stop is normal when the blower should be blowing but isn’t. If the blower keeps spinning after it’s spun up manually everything is now working as intended. If it stopped blowing the furnace would go into safe mode again. Ask me how I know and I’ll tell you I had a broken blower on a cold winter before Gemini was a thing.

bityard · 2026-06-06T10:57:14.000Z 1780743434

None of what you said is actually how furnaces work.

"Spinning it to bypass the safety" is not a thing.

Please don't spread FUD.

pesus · 2026-06-05T22:16:20.000Z 1780697780

Welp, AI almost killing someone is definitely an "oh shit" moment.

saturn8601 · 2026-06-06T02:02:14.000Z 1780711334

Wonder how many AI deaths have occured that we dont know about(since they presumably died). With the adoption numbers we are seeing it much have happened already.

ben_w · 2026-06-06T07:32:17.000Z 1780731137

I'd be surprised if it was less than hundreds, or more than hundreds of thousands.

High hundreds of thousands feels like the upper limit before it would show up in statistically noticeable changes in patterns of deaths in some demographic.

High hundreds of individuals would still be "one in a million fatal errors over a few years", which seems better than I'd expect given I've personally had ChatGPT tell me that Solanum nigrum berries were "black tomatoes" (they're not usually fatal, but are a bit toxic, and no I did not eat them).

ihsw · 2026-06-06T01:00:38.000Z 1780707638

The most interesting part is that there is no direct line between someone's accidental death and a chatbot giving life-threatening advice.

Imagine one of the models that has "accidental-deaths-via-bad-advice" just slightly turned up, with the model-provider's intent being to kill 5% more people per year.

sugarkjube · 2026-06-06T06:04:18.000Z 1780725858

Killing your customers is not the best way to stay in business

ben_w · 2026-06-06T07:40:10.000Z 1780731610

If you're paranoid (or a hawk), imagine a Chinese LLM that only offers fatal advice when queried English, or an American LLM that only offers fatal advice when queried in Chinese. Or American and Russian models which only offer up fatal advice when queried in German, Finish, or Danish.

kunjanshah · 2026-06-05T22:14:32.000Z 1780697672

https://www.covenantairesolutions.com/post/what-is-a-furnace...

“At its core, it's a small motor with a fan attached that has one primary job: to vent harmful exhaust gases out of your home before the burners ever kick on. This is the very first step in the heating sequence, and it's non-negotiable for a safe startup.“

Izkata · 2026-06-06T02:32:58.000Z 1780713178

The original comment was unclear whether the fan kept spinning while the furnace was running, or if all it did was bypass the safety and the fan didn't continue to spin while the furnace ran. They clarified in their response it kept spinning.

MPSimmons · 2026-06-06T03:05:59.000Z 1780715159

It seemed obvious to me that this was bearing stiction and that manually rotating it during the start allowed the fan to spin on its own after that, but I could be wrong and maybe the fan was dead entirely?

modriano · 2026-06-06T04:02:10.000Z 1780718530

Yeah, that would be my assumption too (based on my admittedly incomplete personal experience where I got my furnace running by manually spinning my draft inducer motor, which kept spinning).

As exhausting the combustion products is a critical safety feature, I would be surprised if any furnace was designed such that it could possibly keep running if the draft inducer motor stopped. It seems like it would be trivially easy to make a circuit such that gas valves could only open if the draft inducer motor + fan wasn't spinning.

shreddude · 2026-06-05T19:56:48.000Z 1780689408

I could go on and on, but Claude recently decompiled the firmware of my camper van, documented all the CAN interfaces, then programmed an ESP32 module to talk to the van’s integrated systems (power, HVAC, lighting, tanks). That sort of embedded systems integration is completely out of my wheelhouse.

I honestly don’t understand AI naysayers. I use Claude every day both professionally as a Solution Architect and personally in a variety of projects I simply could not have ever approached alone.

williamdclt · 2026-06-05T23:41:35.000Z 1780702895

> projects I simply could not have ever approached alone.

I think that's part of the divide between enthusiasts and naysayers. If you use GenAI on things that you couldn't approach alone, it's an incredible tool. If you use it on stuff that you're pretty good at, it's not a gamechanger (and if you're an expert, it's a minor boost at best). Many people's job are about doing what they're an expert at.

pmontra · 2026-06-06T04:09:53.000Z 1780718993

I'm about to complete a new non trivial functionality in a project of a costumer of mine. I spent an hour writing the spec. Then I asked Claude (Sonnet 4.6) to check if I missed something. I did, the sort of minor issues one notice after starting writing code, edge cases etc. That made me think about more issues and after a few iterations we settled down on a spec. I asked Claude to make an implementation plan and we ended up with 9 steps. It wrote the code for a step with new automatic tests and I performed some manual QA, which found further issues we didn't think about. We are at step 8 of 9 in about 12 hours of work. I would have needed a week to be there alone, with time spent researching and fixing bugs I created along the way, an inevitable part of our job but not exactly the most pleasant one.

This speedup is great. It improves the overall quality of the product (as perceived by the users) because I can ask Claude to add features that my customers and I would have dismissed because they take too long to implement. We would have settled down with a more basic UX.

So is it a game changer? It is in the same way those HTML / CSS framework like Bootstrap were game changers: suddenly every developer could create a decent and consistent UI in a fraction of the time with a few bells and whistles that we wouldn't have bothered coding. As a side effect a lot of web apps felt look alike mass products and web designers had to reinvent themselves, but the economics leaded inevitably in that direction. Would I spend again one of two weeks doing alone what I could write in a day or two with a LLM? Not anymore, not at this cost ($20 per month.)

jowsie · 2026-06-06T09:22:00.000Z 1780737720

I'd love to read a full transcript of someone going through this kind of collaborative programming. I see this kind of process mentioned a lot but can't quite figure out the details in my head. If anyone has a link to a blog post or similar showing this process in depth, I'd love to give it a read :)

bawolff · 2026-06-06T02:04:09.000Z 1780711449

I think part of it is we often notice bad AI usage. The llm generated "art" by someone with bad taste, or the patches to open source projects by people who cant program at all and are teerrible.

If the use is half decent people just dont notice it.

tstrimple · 2026-06-06T06:32:28.000Z 1780727548

Anti-AI zealots (from a practical usability position. Not necessarily the moral ones) are like the people who looked at The Daily WTF and decided no humans are capable of programming. They had plenty of examples to point at, but refuse to look at decent to great programmers. The stories of "The AI deleted my database!" are prevalent and boosted by these folks because it confirms their biases. It literally doesn't matter if the LLM wrote strong warnings about the action about to be taken. They don't see that aspect of it. Just the fact that someone claims "The AI deleted my database!" is enough for them.

Despite all the liars telling me gaming is easier on Linux than Windows, most new games have some sort of issues launching with default settings. CC is able to dive into both the exact error logs and the recent community feedback on what tweaks / configurations are needed to make it work. I rarely have to go beyond two prompts before a game is playable. CC and Proton are enabling the Linux gaming experience far more than Linus ever has or ever was interested in.

LouisSayers · 2026-06-06T01:27:50.000Z 1780709270

I find it's a huge boost for my day-to-day work.

If you work on architecture and Claude docs, then you can essentially just have it fill in the gaps. Work then mostly becomes a matter of defining what the next piece of functionality is (which you can also use Claude to help with).

The stuff that used to take days now takes hours. It's not perfect, but if you get your codebase into a good shape then the payoff is huge.

mattmanser · 2026-06-06T07:18:55.000Z 1780730335

I re-read something I did 6 months ago doing this.

It's so obviously AI and had much less value than I thought now I look at it with fresh eyes.

Worse it doesn't read like I wrote it, I don't recognize myself in the doc.

jorl17 · 2026-06-06T02:15:31.000Z 1780712131

While I think this is true

> If you use GenAI on things that you couldn't approach alone, it's an incredible tool.

I think this isn't true in all cases

> If you use it on stuff that you're pretty good at, it's not a gamechanger (and if you're an expert, it's a minor boost at best).

I think even then there's a divide.

I mostly work greenfield projects (and love it!). For these, AI has been a literal game changer. Our projects are built faster, with one or two orders of magnitude more automated tests, and all quality metrics are up.

Meanwhile, nearly all of my friends complain that AI doesn't help them. But they mostly work in very large existing codebases.

Still, even in large projects I think AI (the expensive variant) has been a complete gamechanger for me. Sure, I spend a lot on tokens, but I just feel happier and enjoy what I do more. The singalong people say about "thinking at a higher abstraction level" is what I feel. I really am thinking about architecture and larger patterns, instead of the boring nitty-gritty (which wasn't boring at all when I was a kid learning to code!...)

I think a key factor in all of this, to me, has been dictation. Most of the time, I don't write -- I use voice-to-text. I don't even read what comes out of it -- the LLMs get it (it is mostly unintelligible to anyone else) .

This means when I'm planning a big feature, I give a gigantic brain dump to the LLM in perfect stream of consciousness way, going through ideas, pros and cons, edge cases, what exists, what doesn't exist, where I'm sure of something, where I'm not sure and want the LLM to browse the state-of-the-art. Sometimes I spend 20 minutes just talking to the microphone before I send the first prompt. When I pair that with Opus, I find that I am able to build much faster and to go through alternative designs much more frequently as well.

I keep trying to tell all my friends: use voice to text and braindump to the computer. But they refuse... I couldn't imagine having to type everything nowadays. Even though I'm a fast typer, it's still much slower than the speed of my thought, which, granted, is still faster than the speed of my voice.

In effect, I filter much less, but I've come to think that's positive for the good LLMs: I throw all the edge cases and what ifs I'm thinking about -- all those years of experience dealing with similar systems.

If I wanted to go back to work in-office, that would be my major problem: I need to be able to talk with my computer all the time, loudly, and pacing through my room.

bthallplz · 2026-06-06T11:06:12.000Z 1780743972

Yay for dictation! It's so nice to just think aloud and then have an easily editable record of your thoughts, even when you aren't feeding the outputs to LLMs.

400thecat · 2026-06-06T10:01:38.000Z 1780740098

How do you use voice-to-text? You mean, in the browser? I am only familiar with Claude Code, which I have installed on remote server, and there obviously, voice-to-text does not work. I have to type, which is tiring.

bigfudge · 2026-06-06T10:59:27.000Z 1780743567

I’ve installed Hex on os x. You just hold down a hot key to talk and it writes into whatever text entry widget is focussed.

dawnerd · 2026-06-06T00:58:31.000Z 1780707511

And in a team setting it can really accelerate tech debt especially if used by people that know just enough to be dangerous.

seventytwo · 2026-06-06T12:21:12.000Z 1780748472

The dangerous thing is when you’re a novice and can’t identify the BS. That’s why for people with “good” and “expert” skill, it’s not a huge boost. They can identify the BS, and what’s left is modestly helpful.

The highest danger in using AI comes precisely to people who stand the most to gain from it.

jesse_dot_id · 2026-06-06T01:02:37.000Z 1780707757

Same. I'm a DevOps engineer, so a jack of all trades master of none type of guy, and Claude Code backfills my knowledge gaps and turns me into kind of a superhero. I think it's key to already have a pretty good idea of what you're looking at, though.

doctorwho42 · 2026-06-06T05:24:25.000Z 1780723465

Maybe because the scale of investment out strips the value?

What trillion dollar problem is AI solving?

fragmede · 2026-06-06T06:55:15.000Z 1780728915

If you're going to put it that way, companies, globally, spend something on the order of $20 trillion on office workers. If corporations didn't have to spend that money on them, and everything else in order to support them, they wouldn't.

luckystarr · 2026-06-06T10:58:03.000Z 1780743483

Then the workers wouldn't spend 20 trillion and the economy as a whole would tank.

angusturner · 2026-06-06T12:28:45.000Z 1780748925

In 2017 I worked tirelessly with my colleagues to implement and replicate the first transformer paper.

Yesterday I left Opus 4.8 to go do some architecture research, with GPU access.

It replicated and trained a credible baseline. It implemented some ideas I'd been thinking about, and wrote custom CUDA kernels for them. It read and summarised dozens of related papers.

It has since run dozens of experiments, with minimal supervision. When a model is unstable it kills it, documents why, fires off a new configuration.

The realisation that frontier labs are doing this at scale with unlimited GPU and token budgets.

It actually scares me a bit. The realisation that the next big breakthroughs will only have light human involvement.

The prospect of recursive self improvement feels more to real to me all of sudden

jp57 · 2026-06-05T21:32:27.000Z 1780695147

Actually seems absurdly simple now, but sometime last year I was trying to figure out what I'd need to tow my daughter's car cross country with my truck: what are the trailer/dolly options, what do they cost, can my truck actually tow the combined weight, etc.

I started out prompting ChatGPT kinda how I would with Google, one small prompt at a time, asking about various details. But after one or two of those I just tried "I want to tow a car of make A with my truck model B, from point C to point D, what are my options?" And it wrote me a report with comparison tables and computed towing weights and other details for different options.

At that point, I was like "Oh. This is different. And it's just the beginning."

SamuelAdams · 2026-06-05T22:27:40.000Z 1780698460

Similarly, I used gen ai to review a real estate purchase. I provided Zillow listing photos and serial numbers of all appliances, the electric panel, and a few additional not pictured areas that I took during the walk through.

I prompted the AI to write a report as if it were a home inspector and it actually did a better job and identified some issues the paid 750 usd inspector missed.

j_bum · 2026-06-05T23:58:25.000Z 1780703905

From pictures alone? What are some examples?

bombcar · 2026-06-06T00:59:06.000Z 1780707546

I presume something like this: https://www.penny-arcade.com/comic/2007/06/22/perfectly-reas...

SamuelAdams · 2026-06-06T01:25:49.000Z 1780709149

It noticed a flooding area due to low grass by the walkout door. It noticed mixed 15 and 20a receptacles on the same circuit. It noticed warped siding and recalled circuit breakers still in use.

jimmaswell · 2026-06-06T04:26:02.000Z 1780719962

15A and 20A receptacles on the same circuit sounds fine as long as it's a 20A circuit? And how could it tell which outlet is on which circuit?

albedoa · 2026-06-06T03:43:29.000Z 1780717409

What, the Zillow listing of you home doesn't have pictures of mixed 15 and 20a receptacles on the same circuit that an AI caught but that an inspector missed?

Is that what you're telling us??

flyinglizard · 2026-06-06T00:06:06.000Z 1780704366

It very plausibly might have been totally wrong.

Out of laziness I several times asked Claude and ChatGPT each some torque figures and other simple, hard data related to my dirt bike. They often got it completely wrong, but full of confidence every time. I never trust LLMs with hard data, unless you RAG the PDF into the context and even then it's sketchy.

jp57 · 2026-06-06T12:13:20.000Z 1780748000

It wasn’t wrong, though, in my case.

saturn8601 · 2026-06-06T01:55:23.000Z 1780710923

Dates matters. Questions I asked about my Mazda a year ago that were total hucillunations were answered very well this year. To me it feel like the early days of computing. What was not possible one year became possible when a new generation CPU or GPU came out and you have to consistently re-evaluate your expectations or else you'll miss the things that others are discovering with fresh eyes.

I made this personal 'benchmark' of odd and strange questions a few years back when this took off and I would keep re-running these questions whenever some big news came out about a new model and also going back and fourth between the different companies to see where they all stood. (Obvioulsy with clean cache/new accounts)

10 questions: In 2023 it could only get past question 3-4 to reaching the last question and still hacillunating(last year) to providing sources pulled from really obscure books(this year).

For example, one of the harder questions was about the transition of a particular 30 second portion of a background song used in a 30+ year old Bond film that was only played once in the entire film. Went from totally making up nonsense to accurately describing the music theory defintiion of the transition(called a 'stinger') to also explaining why it was done in that particular scene of the film and also providing sources from a snippet of a unrelated interview with the composer explaining his mindset at the time.

Maybe this isn't considered a real benchmark as its not reproducable but for a 'personal benchmark' I came away impressed. I would consider everyone to define their own benchmarks and 'tests' and to consistantly challenge the models to see if there are any meaningful improvements. Now I treat the AI as something to keep skeptical but to also to always consider what it proposes as an answer(ie. dont ever dismiss it outright). I sometimes wonder if this is slowly messing up my biases and maybe thats what Altman, Amodei and others want.

glouwbug · 2026-06-06T01:40:28.000Z 1780710028

Hard numbers, no. Even high level concepts and theory you need to triangulate and prompt in different angles, across different models, and figure out what overlaps to build a mental mode that’s - even then - roughly 80% correct. It’s better than google, but the information isn’t free

boston_clone · 2026-06-06T08:14:07.000Z 1780733647

Fascinating; you used a non-deterministic tool - one that disclaims its own accuracy - to calculate critical information that could result in serious damages or physical injury? Did you like, double-check the results?

One must imagine how many claims have been denied by insurance companies for doing something like this...

SubiculumCode · 2026-06-06T05:58:40.000Z 1780725520

For me it was right at the beginning. They said it was a dungeon game. It would describe a room, etc, and I would take some action. But I thought that this dungeon was built in some intricate database. But then I told it that I wanted to leave, got to an inn, where I flirted with the bar waitress, and soon we were watching the sunset in some meadow. As cheesy as that was, it was then that I went "oh shit" this is a machine that can respond to language with language in a way that simulated actual understanding and intelligence, concepts and schema, and everything else, and I knew then that the world would never be the same again. People here talk about the crazy things they solved with AI, and I get that...but the first time I actually talked to a machine and didn't feel like it was either random gibberish or scripted, but dynamic and responsive. The first alien I ever met, and he knew my language.

jldugger · 2026-06-06T08:18:39.000Z 1780733919

> But then I told it that I wanted to leave, got to an inn, where I flirted with the bar waitress, and soon we were watching the sunset in some meadow.

Immediate Silicon Valley vibes: https://youtu.be/S8MAV9jhf04?t=18

loudmax · 2026-06-06T02:41:23.000Z 1780713683

For me it was torrenting a 7G ball of weights leaked from Meta and running alpaca.cpp (an early variant of llama.cpp) on my desktop computer in early 2023. I started asking it questions about the Roman empire and it answered me in English! The responses were generally incorrect, but no worse than what your average American college student might guess at, though delivered with much more confidence.

This was my desktop computer responding to questions in English, not some fancy server in a massive Google data center. Who cares if what it says isn't reliable? Being able to converse with my CPU in English is like having a conversation with a dog!

stogot · 2026-06-06T03:01:45.000Z 1780714905

I did the same and it wa slow but realizd there was no going back. 100x improvement in three year

monuszero · 2026-06-06T04:10:54.000Z 1780719054

We had a monthlong sprint adding robot motion planning features to our codebase years ago, and I was never satisfied with the result. As a small team wanting to leverage oss we vendored in OMPL, did the usual thing around caching and roadmap management. I knew there was a way to parallelize some of the algorithm we were using with simd or a gpu kernel, plenty of that in the literature, but it was never worth fighting CUDA or metal/accelerate or whatever for uncertain gains.

So when cooking dinner one night, I set opus 4.6 on a from-scratch native and accelerated roadmap planner implementation (after previously porting IK, FK, collision checking with some success) I had primed it by having a research agent drop a literature review in its docs folder covering the type of planner we needed. By the time the pasta water was boiling it was done- getting plans in a few hundred ms compared to several of seconds on our good old fashioned OMPL code.

For me it was the revelation that the economic value of cooking dinner could be compared to tackling an honest two weeks of coding work. The calculus has shifted - work that was once a risky or extravagant use of time is now worth considering.

For a small team who wants to focus on substance rather than implementation, knows what they want, and how to set up the agent for success, it’s a complete game changer in terms of what we can take on. Incumbents beware

AussieWog93 · 2026-06-05T23:07:41.000Z 1780700861

Literally just last night I have Claude Code the following prompt, verbatim:

"Whenever I launch Kodi on my Chromecast 4k, it crashes. I think this is related to a plugin or skin. It goes away for a bit if I clear cache but will eventually come back. Can you connect to the device via adb (I've run adb connect already), and debug exactly where it's crashing? Once you've done that, propose a solution. If this requires downloading, fixing, rebuilding and then uploading the broken extension via adb, don't be shy. I should have Android dev tools (Gradle etc.) on this Mac."

Lo and behold, without human intervention, it pinpointed the crash, downloaded the Kodi source, patched out a bug that had existed since 2016, recompiled it, signed it, then pushed it to my Chromecast all while carefully making sure to keep all my settings intact.

Got it to make a PR too (which is as of this moment unpublished; going to test more over the coming weeks).

darksim905 · 2026-06-06T04:01:47.000Z 1780718507

I know this isn't apples to apples, but given that I can't get Copilot or other tools to view a simple profile page on LinkedIn makes me curious/skeptical how this would work in this depth. I'm sure it's possible but I'm curious what the skills and toolchains involved were for you to get all that to work.

AussieWog93 · 2026-06-06T04:32:30.000Z 1780720350

Full claude.md is here: https://github.com/EspoTek/.claude/blob/master/CLAUDE.md

The skills I have installed are:

```

    on         frontend-design:frontend-design · plugin · ~90 tok · locked by plugin
     on         agents-sdk · user · ~150 tok
     on         cloudflare · user · ~130 tok
     on         cloudflare-email-service · user · ~180 tok
     on         durable-objects · user · ~130 tok
     on         find-docs · user · ~300 tok
     on         find-skills · user · ~110 tok
     on         sandbox-sdk · user · ~120 tok
     on         stage-chapters · user · ~40 tok
     on         web-perf · user · ~150 tok
     on         workers-best-practices · user · ~130 tok
     on         wrangler · user · ~120 tok

```

The plugins I have are:

```

    cc-caffeine Plugin · samber ·  enabled
    frontend-design Plugin · claude-plugins-official ·  enabled
    ty Plugin · claude-code-lsps ·  enabled
    vscode-langservers Plugin · claude-code-lsps ·  enabled
    vtsls Plugin · claude-code-lsps ·  enabled

```

There's also an MCP for Context7.

But yeah, this is more or less vanilla Claude Code - at least, nothing related to Android or adb there.

It's that good now. A few days ago I asked it to SSH into my Ubuntu box and investigate a hang. It didn't solve the problem fully autonomously like this time but did tell me a whole lot things it wasn't, and hinted at a faulty driver. We went back and forth a bit, it set up a watchdog and taught me how to update the kernel without updating Ubuntu itself, and the server has been rock solid for the past 3-4 days now.

Also, if you're curious, full log for the Kodi issue:

https://github.com/user-attachments/files/28659304/2026-06-0...

I did prompt it a little bit more today in order to get something more production-ready (the original solution kept regenerating the cache on boot, rather than fixing it permanently), but you can see the whole original autonomously-generated solution in the logs. It's insane, seriously.

dcre · 2026-06-06T06:38:25.000Z 1780727905

Claude Code (or Codex, or OpenCode, or Pi, or Amp — whatever) can do this out of the box without any skills or special tools. The most important thing for making results like this easier to achieve (in any harness) is using the best current models. Right now that's Opus 4.8 and GPT-5.5.

knollimar · 2026-06-06T12:04:31.000Z 1780747471

I mean you can likely use Opus 4.7. I barely notice a difference. 4.8 confabulates more for me

mft_ · 2026-06-06T07:27:27.000Z 1780730847

To expand on another answer, it’s all about the harness. Different harnesses (Claude Code or Cowork, Hermes, OpenCode, Pi, etc.) offer different default tools, system prompts, and ultimately approaches. (IME the corporate CoPilot app is terrible - basically a chat interface.)

I’m currently using Hermes for local LLMs - seems pretty good so far.

senko · 2026-06-06T09:03:01.000Z 1780736581

LinkedIn in particular is quite aggressively blocking any automated attempts to read or navigate through it.

I post quite a lot there and wanted to have a copy of my posts on my blog[0] to preserve them. For a few months I was able to use a headless browser + claude code, then LI wised up and started logging it out, so I had to use a regular Chrome, log in manually and then tell the LLM to take over and slowly go through my feed.

If you're accessing sites which are not actively blocking bots, or - gasp - have an API, it's much better.

[0] example: https://blog.senko.net/may-quick-takes

blablabla123 · 2026-06-06T07:20:53.000Z 1780730453

So I'm scrolling through this Ask HN and this is now the 3rd similar problem. Would you mind adding more details as well as the patch? Perhaps as a gist if it's unfinished?

I mean just googled https://www.google.com/search?q=kodi+crash+chromecast+4k I'm getting really a lot of issues such as https://forum.kodi.tv/showthread.php?tid=381239

It seems to be a quite common problem. Are you sure it was the rube goldberg fix and not a more mundane solution? Such as pulling in someone's fork from GitHub or just clearing the cache on a loop?

AussieWog93 · 2026-06-06T08:15:25.000Z 1780733725

Here's the draft PR: https://github.com/xbmc/xbmc/pull/28404

And yes, it correctly diagnosed the problem - I confirmed this morning. The cache had been partially deleted (exactly like it said) and the patched version of the software automatically detected this and rebuilt the cache rather than crashing. This was using the initial version of the patch from commit 1 of the PR.

I then talked with Claude a bit to come up with a less hacky solution that doesn't require constant cache rebuilding, and it suggested writing the "cache" to no_backup, bypassing the cache trimmer. However, this required rebuilding the .so via NDK, so it spun up a full VM in multipass, installed all the tools in there to build the fully patched APK, and built it (the VM was my suggestion, it was about to just brew install everything and mess with my local dev environment).

You can read the full log here, it's nuts: https://github.com/user-attachments/files/28659274/2026-06-0...

I think the key takeaway from this experience (and a few others recently) is that Claude Code works much, much better when you explicitly instruct it to test against real data.

Had I simply described the issue and asked it to think up a solution it likely would have just navel-gazed and then come up with a wrong solution. But by pointing it at a real working environment and actively encouraging it to get its hands dirty, it found the actual solution rapidly - in spite of the fact that I gave it wrong information twice.

blablabla123 · 2026-06-06T09:21:33.000Z 1780737693

It's the cache, I pin-pointed the main problem correctly without ADB access, any closer details and just google.

> // Unpack into no_backup storage rather than the cache dir. Android may

> // delete files from getCacheDir() at any time to reclaim space, which

Looking further into the issue disk space is a huge problem with Kodi discussed plenty of times. In fact even the Wiki dedicates 2 pages to it:

https://kodi.wiki/view/Archive:Reduce_disk_space_usage

https://kodi.wiki/view/Texture_Cache_Maintenance_utility

I realize from your perspective this may seem still a very convincing example in the sense of it works.

A non-programmatic solution might have been possible though:

> It's likely your thumbnail cache. That's typically the biggest piece stored locally (you also have the database). You can clear the cache (short term fix) or move it to another drive (long term fix).

> Also recommend not downloading actor thumbnails. Lot of extra images.

https://www.reddit.com/r/ShieldAndroidTV/comments/1f7xfwn/ko...

I also recommend: https://en.wikipedia.org/wiki/Data_dredging

AussieWog93 · 2026-06-06T09:31:37.000Z 1780738297

It's not (strictly) a disk space issue. It's Android aggressively trimming cache files that Kodi assumes are never trimmed.

There's a single variable that keeps track of whether or not the cache has been "written", but Android only trims some of the cache files.

AussieWog93 · 2026-06-06T12:12:11.000Z 1780747931

Although if it makes you feel better it turns out this issue was a dupe. Someone else's Claude probably fixed it before mine did, haha.

calf · 2026-06-05T23:35:52.000Z 1780702552

That's amazing, as someone who struggles to find something useful to do with LLMs. How long does this take, several minutes or more? Do you need a paid version of Claude Code for this?

AussieWog93 · 2026-06-05T23:44:33.000Z 1780703073

It sat there for about half an hour working out the problem, step by step, before asking me for the preferred solution. At one point, it was trying to decompile the .APK, so I interrupted it and reminded it that Kodi was open source - it was welcome to clone from GitHub.

The only other feedback I gave it mid-process was wrong (I said that the crash probably wasn't caused by cache trimming, it ran some additional tests to confirm that its hunch about cache trimming was right).

This was with the paid version of Claude Code (I don't think they offer a free version at all; that's a Codex thing). The $20 version is as smart as the $200 one, but once you work out it can do stuff like this you'll quickly burn the $20 token limit. :)

The other thing that helps is a CLAUDE.md file - authored of course by Claude itself. Mine's here: https://github.com/EspoTek/.claude/blob/master/CLAUDE.md A lot of it is probably domain-specific for the stuff I do, but the "Working with unfamiliar data or systems" section is bloody gold! Stopped the bullshit completely!

evdubs · 2026-06-05T20:01:39.000Z 1780689699

I tried to see if an LLM service provider could rewrite some legal docs where nothing was hallucinated in order to follow a consistent format to see what may be missing in the document. It could do that.

Next, I wanted to see if this could be done with a local LLM. Gemma-4 handles this fine with an 8GB video card and a large context (128k).

Next, I wanted to see if the model could also OCR these docs and translate them. The same model can handle that quite well.

This was when I realized LLMs should be great for handling work where:

- I already know what I want to do

- I already know how to do it

- I don't think this task will help develop skills I find to be valuable

- If I have to do it manually myself, I will probably cut corners

So now I view LLMs through the lens of, "what work can I send to an LLM that I otherwise would not really care about doing."

SoftTalker · 2026-06-05T20:19:46.000Z 1780690786

Yes, the best results I've had using LLMs are for tasks where simply reading and reformatting/translating/summarizing are the goals. They are much faster and less prone to boredom doing these things than humans are. For now.

gscott · 2026-06-05T23:42:43.000Z 1780702963

My son is in a lawsuit with his bank where they put through fraudulent charges and wouldn't charge them back then the bank sued him for the money. He is using Claude and Gemini fighting the original lawsuit and now has a counter-suit 100% using AI for everything. He puts it into different AI's to check everything against each other and to come up with more ideas. He started with ChatGPT, moved to Grok, then Claude, but now Gemini is turning out to be the strongest.

kstrauser · 2026-06-06T00:05:55.000Z 1780704355

I'm about as pro-AI as anyone here. I say this with love: anyone using general-purpose, consumer-grade AI for healthcare, law, or taxes is mad. Best wishes to your son, bless his heart, but please have him consult a qualified lawyer before showing up to court with model-drafted legal documents. Among other things, those chats are not privileged information[0] and the banks could subpoena chat transcripts to see what else he might have told them.

[0]https://natlawreview.com/article/new-york-court-rules-ai-doc...

gscott · 2026-06-06T00:42:53.000Z 1780706573

He has had multiple hearings and the Judge has reviewed everything. The court clerk reads every submission and before the clerk puts it in the system they have a in-house lawyer review each document. This is pretty far along. The trial is scheduled for October of this year.

The bank has a lawyer, they were hoping for a default judgement because who can afford to fight the bank. The choice is fight it yourself or declare bankruptcy.

As you already know, AI companies trained on every single document they can find. Those include legal documents. The legal system is structured where you have Federal Laws, State Laws, Federal & State Regulations and Court Precedent. Because of this structure it is not difficult for a LLM to figure out.

bethekind · 2026-06-06T05:56:52.000Z 1780725412

I'm curious if you can have a judge XYZ skill where you have an ai analyze how that judge ruled for certain judgements in the past, and how similar lawsuits/arguments did in front of them. Might help to angle the ai's findings a tad, or might also not be worth the effort. Both are possible

bombcar · 2026-06-06T01:10:10.000Z 1780708210

The only way to "win" as a small is to be pro se and be extremely diligent in understanding what is happening.

Then, it costs you nothing but time.

larrydag · 2026-06-06T10:04:46.000Z 1780740286

Your son should blog the experience. That could be an interesting read.

jasondigitized · 2026-06-05T21:08:06.000Z 1780693686

This. I know how to do this but I don't have the time/energy to do this. "Get me Claude!"

tempoponet · 2026-06-06T11:24:30.000Z 1780745070

I can actually use and enjoy Linux. The "year of the desktop" never came for me, but instead I got the "year of the cli".

For 20 years I've used Linux in one form or another, but I've felt like I was kneecapped for the most basic things. Just trying to plug in an external drive or a second display meant hours of stack overflow and pasting commands I didn't understand.

Now I'm using several Linux machines for Steam, NAS, local LLM, development, and what used to derail a weekend project now amounts to a coffee break while Claude figures it out.

kstrauser · 2026-06-05T21:11:26.000Z 1780693886

I have a large token budget as part of my work. A coworker was scanning some repos for vulnerabilities as a test. He found a scary looking remote exploit in a popular project and shared it with me for a second opinion. I spun up a local instance of the project and ran the POC against it: nothing. Turns out it needed some configuration knobs tweaked to lower some security protections.

So I told the AI what happened, and asked it to fix the POC so that it would work with the default configuration. It chewed away at that for a few minutes until it cheerfully patched the POC into a weaponized version. I ran it. The local instance, which I had just downloaded, compiled myself, and launched with the default config file, immediately crashed.

I got the cold sweats. I've read this novel. I've seen this movie. Wow. I have a blinking cursor on the console of a nuclear information bomb. I tossed and turned all night, got about half an hour of actual sleep, and probably looked like I'd seen a ghost at work the next day.

On the plus side, it gave our team some very clear ethical and moral guidance: we're going to do this, and we're going to share our findings with the relevant authors, because we can. Because I want to live in a world where the good guys are trying to fix problems before the bad guys can find them, I decided to help build that world. It was like, well, I guess this is what I'm doing now.

lobf · 2026-06-06T06:43:52.000Z 1780728232

Sorry, what does POC mean in this context? I don’t see an earlier combination of words for which that would be an initialism.

gregsadetsky · 2026-06-06T06:54:18.000Z 1780728858

proof of concept

kstrauser · 2026-06-06T07:20:39.000Z 1780730439

Yep. It's the term for basically a demonstration of a claim. "Huh, this part of the program code looks like it's vulnerable to a buffer overflow, so I'll write a script designed to get the malicious data into the right place inside the programs dataflow pathway to prove that it's actually vulnerable."

You can have a perfectly legitimate, critical vulnerability without providing a working POC. However, then it's up to debate. "Is it really a problem? Is it even possible to sneak the payload past the various checks to get it into position? Hmm, it's hart to tell... perhaps it isn't." But show up with a working POC and it's hard to argue that it's not a real vulnerability. "I don't think that's actually reachable." "Boom, crash." "Oh. I guess it is."

chaoxu · 2026-06-06T09:54:21.000Z 1780739661

I'm a researcher working in theoretical computer science. Chatgpt found a counterexample of some conjecture I've been trying for 2 years. Also, it one shot many problems I've worked on. It also improved some of my work greatly.

I feel quite useless in the sheer brutal proof writing, counterexample generating skill chatgpt is demonstrating, and wonder what would be the future of my profession.

Simon_O_Rourke · 2026-06-06T12:53:22.000Z 1780750402

I would have loved to have had ChatGPT when I had to do a few modules in formal methods, I'd say it would have eaten through the BS I had to wade through

UncleOxidant · 2026-06-06T01:41:34.000Z 1780710094

I guess I've had several of those moments over the last year and a half. But a recent one was that I was working with Claude to create a spiking neural net MNIST classifier in an FPGA for a demo. Claude took it from concept to PyTorch, to training (training a Spiking neural net isn't necessarily straightforward - that's a whole post in itself, but Claude came up with a working solution), and then to implementation in Verilog and through synthesis into the FPGA. I asked Claude to create a drawing app to run on the PC side that would allow the user to draw a digit with a mouse and then click a classify button. The data from the digit drawing app was to be transferred via USB to SPI to the FPGA. I didn't have a SPI adapter yet (it was on order from Adafruit) so I asked claude to let me communicate with the simulated verilog code running in the Verilator simulator, through a virtual SPI interface. Then I went to lunch. I came back to see the digit drawing app displayed on the monitor. I drew a '2' and it classified it as a 2. In another window I could see the Verilator simulator running and the data being passed. Chills.

alexfoo · 2026-06-05T23:03:19.000Z 1780700599

Someone in the house pressed the button to update the printer (Brother DCP-L3550CDW) firmware and the CSV page that was the basis for an existing Prometheus exporter (drum/toner lifespan, page counts, etc) stopped being a thing. Instead there was an HTML page with all of the information buried in various divs/etc.

I'd planned on writing something myself to parse the HTML and write a suitable exporter but I thought I'd give Claude a chance.

In a sandboxed VM I gave Claude a single static HTML file of the status page from the printer, also in the directory was the equivalent of "hello world" in Go, literally just the minimum needed to do `fmt.Printf("OK\n")`. The directory was called `brother-exporter`. That was it. No other instructions or information. I hadn't told it what it needed to write. I hadn't said what it should do. I hand't told it what language it was supposed to use.

Just by doing a `/init` in that directory Claude decided that it needed to write a Prometheus exporter in Go that would fetch and parse the HTML file from a printer (defaulting to 192.168.1.1) and then present the associated metrics in a way that they could be scraped by Prometheus.

It did this flawlessly in about 10 minutes.

I could have done it in several hours but this was definitely an "oh shit" moment for me. I think the biggest thing was the fact that it guess/assumed so much (correctly) from so little information in the beginning.

agnishom · 2026-06-06T13:11:56.000Z 1780751516

For me, it was GitHub Copilot in 2021. It could autocomplete my Haskell code based on my comments.

mindcrime · 2026-06-06T00:08:00.000Z 1780704480

I don't remember one specific moment, but I was fairly impressed with ChatGPT from the first time I started interacting with it. Was I ready to call it "AGI"? No, absolutely not. But it was clear that it was something new, and it was also intuitively obvious to me that "this AI is as bad today as it will ever be" and that predicting the rate of change would be difficult.

The more I use these things, the more I'm 100% convinced that it makes sense to say they are "intelligent" (for some meaning of "intelligent"). AGI or "human level intelligence"? Still no[1]. But some kind of intelligence. And I'm quite happy to allow that there can be "intelligence" that doesn't work anything at all like human intelligence, so arguments of the form "this isn't real intelligence", etc, etc. carry very (very) little weight with me. I've actually been sitting on a half written blog post on this very topic for a while, titled "The Marquee Sign Says 'Artificial' Intelligence"[2]. Finding time to finish it has been the challenge.

And before somebody says "Use AI to write it for you". Nah. I am generally what you might call "pro AI" and / or an "AI enthusiast" but I still draw lines. I'll use AI for research, for outlining, for brainstorming, etc. sure. But I have a hard-line stance against letting AI fundamentally write for me. I want anything that goes out with my name associated with it to have my genuine voice.

[1]: I like the term "jagged intelligence" that Demis Hassabis has been using. That is to say, the bounds of the intelligence are jagged or spiky: very intelligent in certain areas, much less so in others.

[2]: for any old-skool pro-wrestling fans, yes, that is an intentional nod to "Double A" Arn Anderson and his "The marquee sign says 'wrestling'" catchphrase. :-)

binarysolo · 2026-06-06T07:46:39.000Z 1780731999

I run a remote-first ecom business with a dozen or so team members.

About a year ago, one of our account managers had a life issue, ghosted us, and she held a fairly critical role in the business and gate-kept a bunch of knowledge to some high value vendor accounts.

Because we ran our ops in Google Workspace, we essentially had off-the-shelf RAG and was able to get answers to a lot of things by asking Gemini to go through all her emails/docs/calendar/meetings, reverse engineer what she did, and create an onboarding doc for her successor.

This happened once more a few months later when one of our analysts broke his wrist on vacay, and we were again able to replicate what they did to cover for their absence, this time dabbling in AI agents ("gems") to do a bunch of the regular simple tasks and again it covered things without too many issues.

I def expect Amazon/shopify to at some point replace all of us brand owners with AI bots if they can, but we'll see how long the gravy train goes on.

ai_fry_ur_brain · 2026-06-06T09:00:06.000Z 1780736406

If you're replacable by an llm, then you're doing something extermely poorly. They're terrible decision makers, have no taste and have little to no ability to infer nuance.

Your business should be fine for a long time (assuming an employee doesn't nuke your business's backend or something because it seems like you're doing something wrong on the HR side of things)

iLoveOncall · 2026-06-06T10:10:28.000Z 1780740628

The fact that you're being downvoted shows how astroturfed this topic is.

vachina · 2026-06-06T11:16:06.000Z 1780744566

You mean all these testaments are bs? As an infrequent user of LLM assisted work these stories never really tallied with my experience.

For example I could never throw a bunch of spec/doc at an agent and have it return something useable 30 minutes later. Yeah the code compiles but they don’t work.

Aerolfos · 2026-06-06T11:27:03.000Z 1780745223

Either they're BS, or the people making these statements are self-incriminating to a terrible degree, either they don't care about their work or are outputting a very low level of quality and being amazed at how "great" and how much better AI output is than their own

All the options are extremely depressing

iLoveOncall · 2026-06-06T11:24:00.000Z 1780745040

> You mean all these testaments are bs?

Yes, or at least extremely exagerated. But most are from literal bots ran by Anthropic and OpenAI to sell their shit.

vachina · 2026-06-06T12:35:48.000Z 1780749348

Interesting, then again unsurprising. HN is ripe for and very easily botted.

mlmonkey · 2026-06-05T20:56:32.000Z 1780692992

I have a buddy who's a consultant. His niche area is Netsuite and Oracle (I think). He's an accountant by training and as a consultant his gig was setting up these instances for clients, charging them an arm and two legs. He'd spend a lot of time golfing, and doing these setups was more than enough money for him. In other words, he had cornered that little slice of the market and was making bank.

Shortly after ChatGPT 2.2(?) came out and hit mainstream, I was chatting with him (I was excited af about the possibilities of AI). He tried to pop by bubble by saying "I bet it can't do what I do for my job!".

So I decided to test it out. We went home and I pulled out my laptop. Went to chatgpt.com and then I asked him to enter the specifications of what Netsuite configuration he wanted. So he proceeded to type in the description of what he wanted, the various settings, configurations, etc. i.e., the specs that he typically gets from his clients. And asked it to give him the commands to set it up.

Lo and behold. ChatGPT came back with a series of commands that he needed to run; the options he needed to configure, etc.

He was crestfallen. "Those are the exact commands I run!"

Luckily for him he recovered. He has since settled on a small stable of clients, all privately held companies whose owners he knows and between them he makes enough to keep his golfing hobby fed.

reactordev · 2026-06-05T21:02:08.000Z 1780693328

Sometimes it's the service you provide, not the value. They know it's in good hands, as it's always been (even if they could have rolled their own ConsultBot 2.0)

bonoboTP · 2026-06-05T21:22:33.000Z 1780694553

I have some friends who, since their high school days help some older acquaintances in upgrading their PCs, choosing laptops and phones, helping with setup etc and these older folks have comfortable money and pay him very well above what would seem reasonable. But the trust and years long relationship matter to them.

Llms are great today for buying advice but there are some incentive issues for the future, ads etc. But in some cases the human contact will remain important. In large corporations it's also similar. The money is peanuts either way, and it's worth them for the peace of mind. But this may not hold forever, especially if the more AI literate generation gets to more senior positions.

simonw · 2026-06-05T20:19:54.000Z 1780690794

ChatGPT Code Interpreter back in ~March 2023. I uploaded a CSV file (of police incidents in San Francisco) and watched it load that into Pandas, show me some charts, then export the data to a SQLite database file for me to download.

I write software for data journalists and this new thing appeared to be able to do everything I wanted my software to do just as an unplanned side effect of having the ability to run Python against a folder with some uploaded files in it.

With hindsight it was my first exposure to a coding agent, but we hadn't named the category at that point.