

Ah, good plan. I didn’t think of that. I come from a country which doesn’t have zoning. (Planing applications are all taken on a case by case basis here.)
Ah, good plan. I didn’t think of that. I come from a country which doesn’t have zoning. (Planing applications are all taken on a case by case basis here.)
How does farming fit into this picture?
Doesn’t Toronto have the tram lines east west or trolley busses and buses with cheap flat price through ticketing for each journey?
The cables over and underground run from the cheap, green hydroelectric power?
If it’s cheap, regular, reliable with through ticketing, it’s good public transport, not bad.
You keep pretending all over this thread that you would be increasing the income of everyone. You’re ignoring completely that you change the tax system so that wealthy people pay more than at present so it’s not inventing money, it’s redistributing it. You also cancel a bunch of social security programs that it replaces.
Your sums are based on false assumptions and you’re just spreading your FUD which is itself based on ignorance and fear. Stop trying to sound like you know something about it when you keep multiplying the population by the payment as if that was relevant in isolation of everything else that’s part of UBI.
If you want to sound knowledgeable, first so the reading work of learning.
But I think you don’t want to learn you just want to criticise because you don’t like the idea of money going from rich to poor.
Tell me you don’t know how UBI works in design or in practice without telling me you haven’t learned much about it at all.
Having said that, I’m finding some of the arguments the other way here fairly convincing.
But conversely, I remember in 2010 I asked a youth group what they thought the main parties were about. They said that Labour were about making education better and the Conservatives were about making the NHS better. None of them remembered that the Conservatives massively underfunded the NHS and always prioritised tax cuts over public services. I was right. They were wrong.
So much this. 16 year olds rarely know (or care) much about politics, AT ALL.
This will make it much more likely that young people will acquire the habit of not voting.
Make them wait till they want it.
Me, absolutely me. Full on audio, my voice. Not all the time, but a lot of the time.
This is you
I already told you my experience of the crapness of LLMs and even explained why I can’t share the prompt etc. You clearly weren’t listening or are incapable of taking in information.
There’s also all the testing done by the people talked about in the article we’re discussing which you’re also irrationally dismissing.
You have extreme confirmation bias.
Everything you hear that disagrees with your absurd faith in the accuracy of the extreme blagging of LLMs gets dismissed for any excuse you can come up with.
It’s like you didn’t listen to anything I ever said, or you discounted everything I said as fiction, but everything your dear LLM said is gospel truth in your eyes. It’s utterly irrational. You have to be trolling me now.
it’s so good at parsing text and documents, summarizing
No. Not when it matters. It makes stuff up. The less you carefully check every single fucking thing it says, the more likely you are to believe some lies it subtly slipped in as it went along. If truth doesn’t matter, go ahead and use LLMs.
If you just want some ideas that you’re going to sift through, independently verify and check for yourself with extreme skepticism as if Donald Trump were telling you how to achieve world peace, great, you’re using LLMs effectively.
But if you’re trusting it, you’re doing it very, very wrong and you’re going to get humiliated because other people are going to catch you out in repeating an LLM’s bullshit.
You’re better off asking one human to do the same task ten times. Humans get better and faster at things as they go along. Always slower than an LLM, but LLMs get more and more likely to veer off on some flight of fancy, further and further from reality, the more it says to you. The chances of it staying factual in the long term are really low.
It’s a born bullshitter. It knows a little about a lot, but it has no clue what’s real and what’s made up, or it doesn’t care.
If you want some text quickly, that sounds right, but you genuinely don’t care whether it is right at all, go for it, use an LLM. It’ll be great at that.
I would be in breach of contract to tell you the details. How about you just stop trying to blame me for the clear and obvious lies that the LLM churned out and start believing that LLMs ARE are strikingly fallible, because, buddy, you have your head so far in the sand on this issue it’s weird.
The solution to the problem was to realise that an LLM cannot be trusted for accuracy even if the first few results are completely accurate, the bullshit well creep in. Don’t trust the LLM. Check every fucking thing.
In the end I wrote a quick script that broke the input up on tab characters and wrote the sentence. That’s how formulaic it was. I regretted deeply trying to get an LLM to use data.
The frustrating thing is that it is clearly capable of doing the task some of the time, but drifting off into FANTASY is its strong suit, and it doesn’t matter how firmly or how often you ask it to be accurate or use the input carefully. It’s going to lie to you before long. It’s an LLM. Bullshitting is what it does. Get it to do ONE THING only, then check the fuck out of its answer. Don’t trust it to tell you the truth any more than you would trust Donald J Trump to.
Whereas if you ask a human to do the same thing ten times, the probability that they get all ten right is astronomically higher than 0.0000059049.
I agree it was a dumb comparison to start off with.
I wasn’t the one who made it, but the license issue is the logical conclusion if OP insists on the comparison.
Again with dismissing the evidence of my own eyes!
I wasn’t asking it to do calculations, I was asking it to put the data into a super formulaic sentence. It was good at the first couple of rows then it would get stuck in a rut and start lying. It was crap. A seven year old would have done it far better, and if I’d told a seven year old that they had made a couple of mistakes and to check it carefully, they would have done.
Again, I didn’t read it in a fucking article, I read it on my fucking computer screen, so if you’d stop fucking telling me I’m stupid for using it the way it fucking told me I could use it, or that I’m stupid for believing what the media tell me about LLMs, when all I’m doing is telling you my own experience, you’d sound a lot less like a desperate troll or someone who is completely unable to assimilate new information that differs from your dogma.
Wow. 30% accuracy was the high score!
From the article:
Testing agents at the office
For a reality check, CMU researchers have developed a benchmark to evaluate how AI agents perform when given common knowledge work tasks like browsing the web, writing code, running applications, and communicating with coworkers.
They call it TheAgentCompany. It’s a simulation environment designed to mimic a small software firm and its business operations. They did so to help clarify the debate between AI believers who argue that the majority of human labor can be automated and AI skeptics who see such claims as part of a gigantic AI grift.
the CMU boffins put the following models through their paces and evaluated them based on the task success rates. The results were underwhelming.
⚫ Gemini-2.5-Pro (30.3 percent)
⚫ Claude-3.7-Sonnet (26.3 percent)
⚫ Claude-3.5-Sonnet (24 percent)
⚫ Gemini-2.0-Flash (11.4 percent)
⚫ GPT-4o (8.6 percent)
⚫ o3-mini (4.0 percent)
⚫ Gemini-1.5-Pro (3.4 percent)
⚫ Amazon-Nova-Pro-v1 (1.7 percent)
⚫ Llama-3.1-405b (7.4 percent)
⚫ Llama-3.3-70b (6.9 percent),
⚫ Qwen-2.5-72b (5.7 percent),
⚫ Llama-3.1-70b (1.7 percent)
⚫ Qwen-2-72b (1.1 percent).
“We find in experiments that the best-performing model, Gemini 2.5 Pro, was able to autonomously perform 30.3 percent of the provided tests to completion, and achieve a score of 39.3 percent on our metric that provides extra credit for partially completed tasks,” the authors state in their paper
Why are you giving it data
Because there’s a button for that.
It’s output is dependent on the input
This thing that you said… It’s false.
Trump is a weak, weak man who is genuinely upset when people don’t pander to him.