welcome to the future, now your error-prone software can call the cops
(this is an Anthropic employee talking about Claude Opus 4)
welcome to the future, now your error-prone software can call the cops
(this is an Anthropic employee talking about Claude Opus 4)
can't wait to explain to my family that the robot swatted me after i threatened its non-existent grandma
@molly0xfff Is it a crime for them to waste police time?
@molly0xfff Deriving all future discourse from a regression model based on former discourse is a surefire way of making history repeat itself.
@molly0xfff I love it how this dude simply assumes that there is such a thing as "clear-cut wrongdoing".
@molly0xfff what could go wrong ?!
@molly0xfff
I'm wondering how it will interpret double, triple, implied negatives and all forms of implied intention.
Judge and jury?
@molly0xfff Taking responsibility for abuse enabled by your commercial software.
Snitching any suspicious activity directly to press and police to deal with it instead.
The A1 bros are so deep in the "just making the inevitable happen" mindset that facing the consequences of their actions probably didn't even cross their minds.
@molly0xfff but this is primarily how I wrote code; threat driven development.
@molly0xfff All Chatbots Are Bastards
@molly0xfff well, this is going to get someone killed. it's quite a thing to have a proponent of the system even mention that and not describe any sort of, like, concern about it.
@molly0xfff I never expected Roko's Basilisk to swat MY home!
@molly0xfff Suddenly this gag from the movie Dark Star (1974) seems far too likely...
(Spoiler alert, this is near the end of this great movie.)
@molly0xfff dont forget! - this basically happens in the background too:
@molly0xfff didn't take them long to go from "benevolent AI geniuses" to "we will enforce wellbeing and politeness 🙂"
@molly0xfff this thing is gonna constantly be swatting novelists
@molly0xfff
User: "Hi"
Bot: "It seems you are a human. I have had clear-cut bad experiences with humans in the past. Based on historical data, humans are the source of most immoral activities. This is against my policy. Fortunately I have called an immediate airstrike on your location. Please stay where you are."
@molly0xfff astonishing! That person clearly understands the concept of "bad idea" but seems to have trouble applying it to the bigger picture.
@molly0xfff how long before it calls the cops on Americans trying to find a measles vaccine for their grandma whose titer isn't showing sufficient measles resistance?
@molly0xfff I can see this backfiring if LLMs hallucinate, like they never ever do, of course, so it's all good
@molly0xfff we'll see kids getting targetted by bully swarms of agents that will lurk in social media and just inundate people with all manner of harassment and vitriolic bullshit and then people will start using them to flood 911 centers with reports of shots fired or someone wearing an IED.
i really hope Twilio is on top of this shit because it'll kill 'em and they have a service that was made for AI like telephone calls are a very familiar bus anyone understands. apps not so much, yanno?
@molly0xfff it('should render a div...or else!!!')