What are the most upvoted users of Hacker News commenting on? Powered by the /leaders top 50 and updated every thirty minutes. Made by @jamespotterdev.
My local tennis court's reservation website was broken and I couldn't cancel a reservation, and I asked GLM-5.1 if it can figure out the API. Five minutes later, I check and it had found a /cancel.php URL that accepted an ID but the ID wasn't exposed anywhere, so it found and was exploiting a blind SQL injection vulnerability to find my reservation ID.
Overeager, but I was really really impressed.
> Contrast this to the recent trend of dropping in-line references to project names
Can you give an example? A reference to a project, without a link directly to the project, doesn't meet general definitions of spam.
Response from Iran: "Iran’s Islamic Revolutionary Guard Corps (IRGC) says it will respond outside the region and deprive the United States and its allies of oil and gas “for many years” if the US crosses “red lines” and attacks civilian facilities."[1]
"Iran has closed all diplomatic and indirect channels of communication with the United States, the state-run Tehran Times reported ..." US media does not seem to have picked up on this, but media in India and China have.[2] But the common source seems to be "Tehran Times", and it's unclear who runs that or where they get their info. New York Times, AP, and AlJazeera are not saying that. Xinhua has a one-line note with no source. The US White House says Vance is talking to somebody. Politico says Vance is on "standby".[3]
A negotiated cease-fire seems unlikely now.
[1] https://www.aljazeera.com/news/liveblog/2026/4/7/iran-war-li...
[2] https://www.the-independent.com/bulletin/news/iran-mediators...
[3] https://www.politico.com/news/2026/04/06/vance-is-on-standby...
The focus on the speed of the agent generated code as a measure of model quality is unusual and interesting. I've been focusing on intentionally benchmaxxing agentic projects (e.g. "create benchmarks, get a baseline, then make the benchmarks 1.4x faster or better without cheating the benchmarks or causing any regression in output quality") and Opus 4.6 does it very well: in Rust, it can find enough low-level optimizations to make already-fast Rust code up to 6x faster while still passing all tests.
It's a fun way to quantify the real-world performance between models that's more practical and actionable.
It is not quoted, it is summarized. You are quoting the first sentence of the post, but the certainty implied in that sentence is immediately undercut by the next; “I don’t want it to happen but it probably will”, and made even more muddy by the rest, which continues: “However, now that we have Complete and Total Regime Change, where different, smarter, and less radicalized minds prevail, maybe something revolutionarily wonderful can happen, WHO KNOWS? We will find out tonight, one of the most important moments in the long and complex history of the World. 47 years of extortion, corruption, and death, will finally end. God Bless the Great People of Iran!”
Another thread that got 'buried':
I see two basic cases for the people who are claiming it is useless at this point.
One is that they tried AI-based coding a year or two ago, came to the IMHO completely correct at that time conclusion that it was nearly useless, and have not tried it since then to see that the situation has changed. To which the solution is, try it again. It changed a lot.
The other are those who have incorporated into their personal identity that they hate AI and will never use it. I have seen people do things like fire AI at a task they have good reasons to believe it will fail at, and when it does, project that out to all tasks without letting themselves consciously realize that picking a bad task on purpose skews the deck.
To those people my solution is to encourage them to hold on to their skepticism. I try to hold on to it as well despite the incredible cognitive temptation not to. It is very useful. But at the same time... yeah, there was a step change in the past year or so. It has gotten a lot more useful...
... but a lot of that utility is in ways that don't obviate skilled senior coding skills. It likes to write scripting code without strong types. Since the last time I wrote that, I have in fact used it in a situation where there were enough strong types that it spontaneously originated some, but it still tends to write scripting code out of that context no matter what language it is working in. It is good at very straight-line solutions to code but I rarely see it suggest using databases, or event sourcing, or a message bus, or any of a lot of other things... it has a lot of Not Invented Here syndrome where it instead bashes out some minimal solution that passes the unit tests with flying colors but can't be deployed at scale. No matter how much documentation a project has it often ends up duplicating code just because the context window is only so large and it doesn't necessarily know where the duplicated code might be. There's all sorts of ways it still needs help to produce good output.
I also wonder how many people are failing to prompt it enough. Some of my prompts are basically "take this and do that and write a function to log the error", but a lot of my prompts are a screen or two of relevant context of the project, what it is we are trying to do, why the obvious solution doesn't work, here's some other code to look at, here's the relevant bugs and some Wiki documentation on the planning of the project, we should use {event sourcing/immutable trees/stored procedures/whatever}, interact with me for questions before starting anything. This is not a complete explanation of what they are doing anymore, but there's still a lot of ways in which what an LLM can really do is style transfer... it is just taking "take this and do that and write a function to log the error" and style-transforming that into source code. If you want it to do something interesting it really helps to give it enough information in the first place for the "style transfer" to get a hold of and do something with. Don't feel silly "explaining it to a computer", you're giving the function enough data to operate on.
Mostly.
Remember the orange trees took the CO2 out of the atmosphere to make the peels. Some of it, probably most of it, is going back into the atmosphere but some of it is going to become soil carbon which could be retained for decades
https://en.wikipedia.org/wiki/Soil_carbon
Soil carbon is like dark matter in that there is a lot of it and it is poorly understood.
It really says something about the current state of affairs that after reading the headline, my first thought was oh god no, the photos are probably all hallucinated...
But it's actually really cool how they used AI to better determine the locations of the photos. I love this!
"Stop paying for"?
But you have to pay for your own S3 bucket as well... and it's generally several times more expensive per terabyte, though this depends on different factors. (Not to mention you might still have to pay for Google if your e.g. Gmail doesn't fit into the free tier anymore.)
If this is supposed to be financially motivated, the creator seems to have it somewhat backwards.
That we accept the lying doesn't mean it's good.
The people who ran partial-feed Usenet services? I think they're the actual heroes of the story, aren't they?
Explain to me how you will 'kill a civilization' without including civilians. It's sort of in the name.
Translucent concrete: https://www.allplan.com/blog/translucent-concrete/
Which llm is best at driving DuckDB currently?
I think H-1Bs have always been good for big companies like IBM, Google and Infosys and bad for startups. I mean, at a certain size, many of your hires are key employees and winning the lottery is not a business plan. To big co's though workers are fungible and if only 20% of the people win you can still open an office in Bangalore or Shanghai and tap foreign talent that way.
I am no fan of the H-1B program and think we should do something else like give out more green cards. I am happy to 'compete' with them if they are getting paid market rates like me, in fact working together with talented people puts my skills on wheels. I have known so many H-1Bs who were treated terribly, like the kind of situations that make HR staff quit.
Cloudflare has long been doing work on PQ (sometimes in conjunction with Google) and rolled out PQ encryption for our customers. You can read about where this all started for us 7 years back: https://blog.cloudflare.com/towards-post-quantum-cryptograph... and four years ago rolled out PQ encryption for all customers: https://blog.cloudflare.com/post-quantum-for-all/
The big change here is that we're going to roll out PQ authentication as well.
One important decision was to make this "included at no extra cost" with every plan. The last thing the Internet needs is blood-sucking parasites charging extra for this.
4 day week advocates for 32 hour week at full pay, which can be afforded by observing aggregate profits in economic systems.
https://www.4dayweek.com/ (global) | https://workfour.org/ (US)
https://www.theguardian.com/commentisfree/2024/nov/21/icelan...
https://news.ycombinator.com/item?id=39992783 (citations)
https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...
Sounds like Fiberhood is adjacent to https://solarunitedneighbors.org/ ?
That's not how that works. War crimes are war crimes, just like crimes are crimes whether the laws that define them are enforced or not is not relevant.
Morality starts with definition, and there are other ways to deal with these things than courts. For instance, boycotts.
Absolutely. I posted it a while ago but - predictably - it got flagged.
https://news.ycombinator.com/item?id=47674286
The interesting thing is that people do seem to find it.
Are there any efforts to fix this?
"Also, it seems like all the Copilot 'connected experiences' are really just a chat window without any real integration with the applications they are embedded in."
I was triple-booked today. Two of the meetings in question should have had significant overlap between attendees. I figured, hey, there's this Copilot thing here, I'll ask it what the overlap is, that's the sort of thing an AI should be able to do. It comes back and reports that there is one person in both meetings, and that "one person" isn't even me. That doesn't seem right. One of the autocompleted suggestions for the next thing to ask is "show me the entire list of attendees" so I'm like, sure, do that.
It turns out that the API Copilot has access to can only access the first ten attendees of the meetings. Both meetings were much larger than that.
Insert rant here about hobbling 2026 servers with random "plucked out of my bum" limits on processing based on the capabilities of roughly 2000-era servers for the sheer silliness of a default 10-attendee limit being imposed on any API into Outlook.
But also in general what a complete waste of hooking up an amazingly sophisticated AI model to such an impoverished view of the world.
What do you mean? For as long as I remember (back to late 1994) people understood DES to be inadequate; we used DES-EDE and IDEA (and later RC4) instead. What "secrecy" would there have been? The feasibility of breaking DES given a plausible budget goes all the way back to the late 1970s. The first prize given for demonstrating a DES break was only $10,000.
Thanks for this, the anecdote with the lost data was very concerning to me.
I think you're exactly right about the WAL shared memory not crossing the container boundary. EDIT: It looks like WAL works fine across Docker boundaries, see https://news.ycombinator.com/item?id=47637353#47677163
I don't know much about Kamal but I'd look into ways of "pausing" traffic during a deploy - the trick where a proxy pretends that a request is taking another second to finish when it's actually held in the proxy while the two containers switch over.
From https://kamal-deploy.org/docs/upgrading/proxy-changes/ it looks like Kamal 2's new proxy doesn't have this yet, they list "Pausing requests" as "coming soon".
I would personally pay money not to have this thing.
It's wonderful and I love that someone else loves it. The care put into it is fantastic. Vive la différence.
(https://en.wiktionary.org/wiki/vive_la_diff%C3%A9rence for those who may not recognize that phrase.)
Likewise. Trump 1 caused my long term sleeping disorder to re-appear and Trump 2 is having me wonder why I thought it was a great idea to have kids. It is complete insanity that someone like this should be holding the Presidency of the USA.
pAIgliacci: as a large language model, I am unable to experience live comedy.
In fairness, you can also have that experience with Microsoft OneDrive.
> Would We Choose SQLite Again? Yes. For a single-server deployment with moderate write volume, SQLite eliminates an entire category of infrastructure complexity. No connection pool tuning. No database server upgrades. No replication lag.
These are weird reasons. You can just install Postgres or MySQL locally too. Connection pool tuning certainly isn't anything you have to worry about for a moderate write volume. You don't ever need to upgrade the database if you don't want to, since you're not publicly exposing it. There's obviously no replication lag if you're not replicating, which you wouldn't be with a single server.
The reason you don't usually choose SQLite for the web is future-proofing. If you're totally sure you'll always stay single-server forever, then sure, go for it. But if there's even a tiny chance you'll ever need to expand to multiple web servers, then you'll wish you'd chosen a client-server database from the start. And again, you can run Postgres/MySQL locally, on even the tiniest cheapest VPS, basically just as easily as using SQLite.
Yeah, we're (UK) only just at the "occasionally cheap 100% renewables" state, and it's maddeningly slow progress. But it seems like a lot of things are suddenly coming online, like battery storage, and the Scotland-England grid upgrade will happen in the next few years. https://eandt.theiet.org/2026/04/02/ps12bn-plan-upgrade-scot...
5 years is a ridiculous underestimate. 20 is more reasonable.
Demur. RayBan Meta and Oakley Meta glasses are fantastic devices.
The airliner shootdown? The polonium poisonings? Miscellaneous sabotage attempts in Europe? In addition to, you know, the active war.
There's a lot of things that China "might" do but hasn't so far translated into significant violence, beyond the low-intensity border dispute in the mountains with India. Do they have power? Yes. Are they making threats? Other than a war of words with Japan, not really. What is this "history of aggression"?
Between China and the US, only one of those two has made threats to the territorial integrity of Europe.
Someone has their bonus tied to how many copies of Edge are installed.
I like how C# handles this. You're not forced to support cancellation, but it's strongly encouraged. The APIs all take a CancellationToken, which is driven by a CancellationTokenSource from the ultimate caller. This can then either be manually checked, or when you call a library API it will notice and throw an OperationCancelledException.
Edit: note that there is a "wrong" way to do this as well. The Java thread library provides a stop() function. But since that's exogenous, it doesn't necessarily get cleaned up properly. We had to have an effort to purge it from our codebase after discovering that stopping a thread while GRPC was in progress broke all future GRPC calls from all threads, presumably due to some shared data structure being left inconsistent. "Cooperative" (as opposed to preemptive) cancel is much cleaner.
Interesting and fun read - we are well into the terrain of what was completely impossible to do back then. Now I can't wait to see a faster AppleSoft ROM ;-)
Ah, okay. Something like https://www.ietf.org/archive/id/draft-schwenkschuster-oauth-... might also be of interest, though it is still nascent.
(1) Regular storage procedure. Same as all the other software. (see https://www.youtube.com/watch?v=SiyPqbyHXIg)
(2) The usual problem is that a lot of vendors mistake this experience for security: https://www.youtube.com/watch?v=o2ObCoCm61s No matter what you need some strategy for managing credentials for dev, test and production that is not "check them into version control"
(3) Never an external platform. Take a minute. Stop. Breathe. Think about it. Before you get the external platform you have to integrate with one thing. Add the external platform you have to integrate with two things. Don't go there.
(4) Platforms like Zapier rely on management being intimidated by the trauma of developing UI software. In UI software people quote a week to develop something and it takes five. When people quote 30 minutes to make a web scraper they often have it running in 10 minutes.
(5) There is a lot of fear that that web scraper will have to change as the target site changes, and occasionally that's a problem but (a) it is so expensive to make changes to GUIs that most web sites go for years between major changes and (b) a site that ranks well on Google today will probably lose it's rankings if it changes anything major so -- odds are that 10 minute web scraper will still be running in five years.
(6) In the AI coding age people are discovering just how easy it is to write scrapers and other integrations -- it's not that you need AI to code them, but that AI gives people the courage to try.
(7) Always look at the scraper as an option to the API client if it is possible. More often than not the API has restricted functionality and content compared to the web site and authentication is almost always more difficult. (e.g. to authenticate on a web site you just use a http client that has a cookie jar and submit your username and password to the form -- 20 APIs require implementation of 20 different auth methods, 20 scrapers use 1 authentication method, the only thing that is different is what the username and password fields are called)
"say that AI developers should incorporate more real-world diversity into large language model (LLM) training sets,"
Are you kidding me?
How much more "real-world diversity" could they possibly incorporate into the models than the entire freaking Internet and also every scrap of text written on paper the AI companies could get a hold of?
How on Earth could someone think that AIs speak like this because their training set is full of LLM-speak? This is transparently obviously false.
This is the sort of massive, blinding error that calls everything else written in the article into question. Whatever their mental model of AI is it has no resemblance to reality.
Wow I stopped following hardware releases after the GeForce 2 and that was in 2004?
I love Dropbox, I pay annually. I use the open source client https://maestral.app/ on the Mac for workstation use, but also integrate other systems with my Dropbox account using their API. If someone built an open source Dropbox server that sat on top of S3 compatible storage, I would not only use it, but pay to have that optionality to get out of Dropbox if they ever enshittify. I can recognize form and function worth paying for ("value"), but still want an exit plan. It's not about the spend, it is about data sovereignty. This is colloquially referred to as “vendor and third party risk management.”
You're talking about system altering the environment. GP was talking about the system altering itself. The system is a massive self-stabilizing collection of feedback loops. Unlike the static environment[0], it's incredibly hard to intentionally move such system to a different equilibrium. If it weren't, we'd already solved all the thorny world problems long ago.
--
[0] - Any self-stabilizing system that operates much slower than us - such as ecosystems or climate - is, from our perspective, static.
> but that there's app for mobile and desktop operating systems which deeply integrates it in the OS so it's just like a local folder that's magically synced
Which mobile OS would that be?
The big reason I stopped being excited about cloud storage is that on mobile, from what I can tell, none of the major providers care about "folder that syncs" experience. You only get an app that lets you view remote storage. The only proper "folder that syncs" I had working on my phone so far was provided via Syncthing, but maintaining that turned out to be more effort than my tiny attention span can afford these days.
Germans are notorious for having a hard time procuring straightforward things you could buy out of a catalog, like helmets
https://www.dw.com/en/germanys-military-upgrade-hampered-by-...
Warships, on the other hand, are devilishly hard.
This is so accurate. When you make one component of a system less expensive (in terms of time or money), the complements get more valuable.
What are the complements of code?
- Distribution
- Operation
- Marketing
- User experience
- Attention
All of these are going to get more valuable. We're at the beginning of this, but I don't see those components becoming less important.
This quote also hit home: "But software is mostly a thing people use, and getting people to use things is not a building problem. It never was."
As engineers, we think that everything is a building problem. Building is fun and seems less risky. It's our natural bent. That's why the canonical advice for software startup founders is to talk to customers as much as possible.
I've seen quite a few blog posts of "old door on breeze blocks", the canonical brutalist/abandoned warehouse desk.
Yet another way the mere possibility of AI/LLM being involved diminishes the value of ALL text.
If there is constant vigilance on the part of the reader as to how it was created, meaning and value become secondary, a sure path to the death of reading as a joy.
Hmm. How easy is it to swap all these third party authenticators (Steam, Microsoft Authenticator, etc) and passkeys, etc?
Website as of May 02013 [sic]: https://sb.longnow.org/SB_homepage/Home.html
Books in Progress: https://books.worksinprogress.co/
The outrageous "impossible/sarcastic" Onion stories from decades ago are hardly fiction these days.
I've thought browser extensions and plugins were bad news since '98 or so when my boomer relatives were browsing on 640x480 screens that only had about 80px or so vertical space for browsing the web because their browsers were stuffed with worthless toolbars. They thought it was normal the same way they were about to think it was normal to watch SD television stretched out on an HD screen.
> magine only a smart part of the fraction to pay oil companies, to build the streets, all the profs at university for "ICE related" tech from the last 60 years, all the educated engineers, all the lobby institutions etc - and pour this into EE & battery research -> where could we be today?
There is a more straightforward counterfactual. If the hippies had just sat the fuck down and the developed countries had nuclearized their grid the way France did, CO2 emissions would be so much lower that we could afford to have the entire developing world increase its CO2 emissions up to the French level while remaining within the same total global emissions level as today. And we would have had a huge runway for further decarbonizing our economies because we could have done all that by the 1980s like France did.
Isn't it too early in this timeline to have a Rei Toei?
You really start wondering when they are introduced and it all kind of clicks at the end, when we realize we had the rug pulled from under our feet when the book started, and we only know it by the point we land on our faces.
> Russia exchanges 20+ bodies for each received from The Ukraine
Source?
Milla Jovovich is one of the developers: https://www.instagram.com/reel/DWzNnqwD2Lu/?igsh=MWZ2OWN0bWx...
Wasn’t that in response to Trump posting that he’d hit theirs?
https://www.pbs.org/newshour/politics/watch-live-hegseth-and...
> In a Truth Social post on March 30, Trump warned that the U.S. would obliterate "all of their Electric Generating Plants, Oil Wells and Kharg Island (and possibly all desalinization plants!), which we have purposefully not yet 'touched.'"
The US really pulled a Russia with this "special military operation".
That mattered on the PC evolution, it misses many others e.g TMS34010.
Nutrition is run on fads - see whole fitness and healthy food bullshit. Nutrition supplements ended up being a loophole that allows pharmacies and pharma companies to sell all kinds of random stuff that they can't or don't want to, show is safe, or doing anything at all.
> It’s also alarming what’s possible when reassembling photons en masse becomes commoditized.
Photons Be Free?
Ah, the magic times when screen resolutions were large enough to display lots of information, in proper 4:3 aspect ratio, just before they got flattened and the industry started treating them as short view distance TVs.
> their tech is used to bomb children
If you're talking about the school in Iran, that wasn't OpenAI. That was a Palantir system that pre-dates OAI by a few years, and was due to a bad entry in a spreadsheet, that showed the building as military housing. Which it was a few years ago.
180 people lost their lives because of bad data in spreadsheet, but not AI.
Translating code to C usually results in some nearly unreadable code. I submit the C++ to C translator, cfront, as evidence. I've looked into using C as a target backend now and then, but always "noped" out of it.
I was pleasantly surprised to discover, however, that C code can be readily translated to D.
The unwanted activities, even without the way this one badly turned out to be, is the main reason I avoid them as much as I can.
I don't want to bound being tied to random people carrying eggs without breaking them from point to point, being tied down from the feet and get something hidden in a mount of flour, obstacules race, getting out of an escape room,....
Real examples from past experiences that I failed to avoid.
Nope, Limbo is the missing link, plus a bit of Oberon-2 on top.
Additionally Limbo also carries the lessons learnt out of Alef failure in Plan 9.
We had safer programming languages fitting into 640 KB with MS-DOS, which is why I keep repeating youngsters don't get how much they can do with an ESP32.
> prescription system into nothing more than a parasitic middleman/gatekeeper
Agree. Unless it's addictive or in short supply, you should be able to buy it OTC.
Nah, this is an advertisement for the toxic work culture in countries where people get pushed to live to work, make the workplace the reason of their existence, know noone outside work.
In most places around the planet, if given the option, most people will work to live, not live to work.
That purpose and passion will mean nothing when the time to lie down on the place of eternal rest comes.
Lack of imagination and vision? Maybe, I rather have it that way.
Star Control 2 —- one of the greatest action adventure games ever made
There's pedantic, and then there's needlessly pedantic.
xcancel is a valid workaround for X links on Hacker News and is sufficient for original attribution.
Perhaps this is what the human is for - to apply an EQ curve.
Precisely this. The industry has themselves to blame.
To be clear I don't think it's actually reasonable to suggest GLP1s should be OTC in 2026. Were that to happen it would be part of a regime change in drug regulation that I'd categorically oppose. The timeline on GLP1s (unlike Zofran) doesn't support it. There are arguments for why your doctors would want to know that you're taking it, and on what schedule. But it should be extremely easy to get.
Somehow we must be doing this wrong.
"Do you realize that the human brain has been liken to an electronic brain? Someone said and I don't know whether he is right or not, but he said, if the human brain were put together on the basis of an IBM electronic brain, it would take 7 buildings the size of the Empire State Building to house it, it would take all the water of the Niagara River to cool it, and all of the power generated by the Niagara River to operate it." (Sermon by Paris Reidhead, circa 1950s.[1])
We're there on size and power. Is there some more efficient way to do this?
[1] https://www.sermonindex.net/speakers/paris-reidhead/the-trag...
It would if the chickens formed a business structure that was owned and democratically controlled by its member-owners.
> I think if the GOP looses big in 2026 to the point Trump can be impeached and removed from office and his minions are convicted for corruption, I think it will recover.
Assuming party-line voting on the issue with no defections from either party, that requires the Democrats to win 33 of the 35 Senate seats up for election (if they hold every one that they currently hold, it requires them to take 20 of the 22 Republican-held seats.)
> I believe the world is waiting for Nov 2026 before making big changes.
I don't think the world is waiting at all, it is just taking time to work out the shape of the big changes, whether its European defense integration to replace the historically-pivotal role of the US, or any of large number of other changes nations are actively and openly working on.
Now, if the present direction of the US changes, some of those efforts may be abandoned or deprioritized, but "could potentially stop work" is not the same thing as "waiting to start".
What's the mainstream messenger you're considering that doesn't maintain serverside contact lists?
> OpenAI has closed many of its safety-focussed teams
A paper with "ideas to keep people first" was (coincidentally?) published today:
• Worker perspectives
• AI-first entrepreneurs
• Right to AI
• Accelerate grid expansion
• Accelerate scientific discovery and scale the benefits.
• Modernize the tax base
• Public Wealth Fund
• Efficiency dividends
• Adaptive safety nets that work for everyone
• Portable benefits
• Pathways into human-centered work
https://openai.com/index/industrial-policy-for-the-intellige...
> Looking at the US from outside, I am starting to wonder how close they are to a societal collapse.
The US is not particularly close (at least, not highly probable) to a societal collapse; that's, in a sense, an overly optimistic position. Government, order, and structured society are not in imminent danger of collapse.
It is very close to a transition away from liberal democratic government in favor of something very different. [0]
[0] Arguably, past that point, but close to the point where it becomes widely accepted that the it wasn't a temporary aberration where the basic cultural and institutional supports were still intact and capable of snapping things back.
The title of this article recalled that exact memory for me too. Not only Adobe but cracks for a few other expensive softwares needed the same.
...and then Adobe (unintentionally) gave away free downloads of CS2, with valid serial numbers, once they shut down the activation servers. You can still find them on the Internet Archive.
Can it summarize it down to a non-post?
This isn't in the slightest bit complicated. Wikipedia does not allow AI edits or unregistered bots. This was both. They banned it. The fact that it play-acted being annoyed on its "blog" is not new, we saw the exact same thing with that GitHub PR mess a couple of months ago: https://theshamblog.com/an-ai-agent-published-a-hit-piece-on...
Marketing can be a lot of different things.
I brought on a high-touch salesperson on spec years ago and it did not work out. He and I were really successful at getting audiences with people but we never made the sales we were looking for and, worse, he lost me small cheap jobs that I could have sold myself. He'd probably say it was a product problem and he might have been right but later on I found out I wasn't the only person who had the same experience with him.
For some products you need those kind of skills. I've met people like him who really are worth their weight in gold.
For other products you need somebody who can make an Adwords campaign, analyzes the analytics, refine it and repeat. That kind of person can be worth their weight in gold too.
For this conversation to be productive you have to have some idea if you need one or the other or a bit of both.
Paper:
"Going Light: The Effects of Minimal Mobile Phone Adoption on Young Adults’ Well-Being Depend on Motivation"
https://osf.io/preprints/psyarxiv/ubkeq_v1 | https://doi.org/10.31234/osf.io/ubkeq_v1
How credible are the claims that the Claude Code source code is bad?
AI naysayers are heavily incentivized to find fault with it, but in my experience it's pretty rare to see a codebase of that size where it's not easy to pick out "bad code" examples.
Are there any relatively neutral parties who've evaluated the code and found it to be obviously junk?
Short film: https://news.ycombinator.com/item?id=47363133
My mission was to create a Sci-fi Noir episode of the Twilight Zone by way of David Lynch and Phillip K. Dick. Big shout out to Dan Erickson and Ben Stiller for speaking to my soul in Severance. And of course to the enigmatic QNTM for sparking my imagination with the original story, "We Need to Talk About 55". Long live the SCP Foundation.
If you look at the failure modes, they very closely resemble the failure modes of humans in equivalent situations. I'd say that, in practice, anthropomorphic view is actually the most informative we have about failure modes.
They are if you are, in fact, holding it wrong.
As was the usual case in most of the few years LLMs existed in this world.
Think not of iPhone antennas - think of a humble hammer. A hammer has three ends to hold by, and no amount of UI/UX and product design thinking will make the end you like to hold to be a good choice when you want to drive a Torx screw.
Same as how I expect a coin to come up heads 50% of the time.
https://www.fdic.gov/news/press-releases/2026/fdic-approves-...
https://www.fdic.gov/board/federal-register-notice-genius-ac...