Hacker Newsnew | past | comments | ask | show | jobs | submit | more E-Reverance's commentslogin

How was he equally brutal


I haven't tried this myself and this might be absurd, but attending PhD defences might be an interesting way to meet new people


It should be noted that this is NOT the official scores on the private evaluation set


Here it matters much less than in generic LLMs though. There's no chance of test set leakage since the network is not general purpose / not trained on the internet.


I didn't know how to title this. I definitely don't believe his proof claims but I found this whole event to be psychologically interesting


> Opus 4.5 likes it

And to think, people said peer review in academia is dead.


One can care about both


> I do actually believe that zero teenagers should make banking apps or run non-profits.

That sounds like a lot of fun and should be a pretty social experience.

Also I'm going to assume his parents are proud, which should put his family at ease.


Surprised there wasn't any mention of Equilibrium Matching [1] in the future work section

[1] https://raywang4.github.io/equilibrium_matching/


Just for reference, the main author's stance on god : https://youtu.be/k_VBzweMIlM?t=125


The article has a hyperlink on it : "Bushart shared an image[1] of President Donald Trump with the quote"

[1] https://x.com/aaronterr1/status/1970272191884468241


Google search for 'trump "get over it" cartoon shooting' turns up many cartoon images. This is a major meme.


>Figure 18: The Taiji-DDN exhibits a surprising similarity to the ancient Chinese philosophy of Taiji. Records of Taiji can be traced back to the I Ching (Book of Changes) from the late 9th century BC, often described by the quote on the left (a) that explains the universe’s generation and transformation. This description coincidentally also summarizes the generation process and the transformations in the generative space of Taiji-DDN. Moreover, the diagram (b) from the book Tom (2013) bears a closely resemblance to the tree structure of DDN’s latent fig. 1b. Therefore, we have named the DDN with K = 2 as Taiji-DDN.

Very nitpicky comment, but I personally find such things to make for a bad impression. To be more specific, branching structures are a fairly universal idea, so the choice of relating it ancient proverbs instead of something much mundane raises an eyebrow.


Not sure about the LLM community, but it's not uncommon in other computer-science communities to assign common names to models and implementations.

Common names will always be influenced by the authors culture, so it seems unfair to exclude a name based on any individual opinion that it is or isn't a mundane choice.

Unless you would also exclude names based on their relationship to old, but culturally relevant texts in the western tradition to, e.g., the bible.


I don't take issue with the name itself, but more so having a whole figure and paragraph dedicated to it in what is supposed to be a technical paper.


I think it's just a fun justification for a somewhat obscure naming choice, I don't think it's trying to introduce woo


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: