Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can probably find some long-form tuned models on HF. I've had decent results with QwQ-32B (which I can run on my desktop) and Mistral Large (which I have to run on my server). Generating and refining an outline before writing the whole piece can help, and you can also split the piece up into multiple outputs (working a paragraph or two at a time, for instance). So far I've found it to be a tough process, with mixed results.


Thank you, will try out your suggestions

Have you used something like a director model to supervise the output? If so, could you comment on the effectiveness of it and potentially any tips?


Nope, sounds neat though. There's so much to keep up with in this space.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: