Quote:
|
My beef is with the METR graph which isn't mentioned once in the paper
|
Quote:
|
Modern AI capabilities are defined by the system, not just the raw pre-trained base weight matrix.
|
That's literally what the graph is, knowingly and on purpose comparing raw models vs models with harnesses it's apples and oranges and they graphed it
Quote:
|
In early 2026, tech analysts and AI researchers heavily panned METR’s capability timelines. Critics pointed out that METR's data is plagued by basic errors
|
Maybe ask ai if that's a fair comparison, point 3 is wrong also, you can totally put 5.1 in a harness and it would score higher point 1
Quote:
|
He argues that performance increases are just coming from the "harness"
|
lol wut when did I say the harness is where the performance is coming from, this entire output is garbage I can only imagine what kind of fucked up prompt you put in to get this to be spit out and be this confused, try Claude or Chatgpt not grok or Google overview
Quote:
|
His "gotcha" is literally just him summarizing a section of the paper he thinks he discovered
|
I didn't even read the paper, this is just common ai knowledge in articles and current debates, esp in the AGI/ASI debate people at Google disagree with other people at Google about this.
this is the same company that had a mustard tiger named Blake Lemoine who thought a now obsolete chatbot from years ago had a fucking soul because of his religious views, smart people can be retarded and have views on super ai gonna kill us all or turn the entire universe into paperclips, one dipshit taken seriously until recently was even afraid of the concept of a ASI in the future with time traveling capabilities that would torture him for eternity for not working on ai, these are just thought experiments same as AGI/ASI