> Lying about model capability is right now the lingua franca of the cloud AI business model
Lying about your lab's capabilities != Lying about model capability
Exaggerating the capabilities of a new model that you've actually trained in press bulletins can be called marketing. Merging two models and claiming that you trained a new model is plain lazy.
I got sent a 6-page spec document with a footnote that says "this spec was created with AI, so it may have nonsensical sections. Feel free to fix them."
> Is it the end result that matters, or the process of creating it?
I think this comment misses the point. Let's forget about AI and assume that there are three developers: A, B, and C. Now, A is supposed to make a PR, but instead they describe it to B, and B writes the code. C reviews the PR and gives feedback. A passes the feedback and the responses between B and C.
As you see, this is not easy for either B or C, and A is totally useless in this scenario. When you replace B with an LLM that doesn't get tired or bored, only C complains about the process.
HN is probably astroturfed, but it is also one of those places where a lot of actual subject experts frequent. I don't find it unusual that a new model gets instantly posted here. It is also not unexpected that the most recently released model is the best model.
> I have not accepted payments from LLM vendors, but I am frequently invited to preview new LLM products and features from organizations that include OpenAI, Anthropic, Gemini and Mistral, often under NDA or subject to an embargo. This often also includes free API credits and invitations to events.
Your username does not check out.
reply