Submissions from sebastianraschka.com

		Kimi K3 Architecture Overview and Notes (sebastianraschka.com)
		503 points by ModelForge 4 days ago \| past \| 111 comments
		Build a Reasoning Model (From Scratch) (sebastianraschka.com)
		4 points by handfuloflight 11 days ago \| past \| discuss
		Controlling Reasoning Effort in LLMs (sebastianraschka.com)
		84 points by ibobev 12 days ago \| past \| 8 comments
		Controlling Reasoning Effort in LLMs (sebastianraschka.com)
		4 points by vismit2000 13 days ago \| past \| discuss
		Controlling Reasoning Effort in LLMs (sebastianraschka.com)
		1 point by matt_d 13 days ago \| past \| discuss
		Inkling: A New Open-Weight 975B Moe with a Few Surprises (sebastianraschka.com)
		3 points by ModelForge 16 days ago \| past
		Using Local Coding Agents (sebastianraschka.com)
		2 points by ibobev 16 days ago \| past
		Using Local Coding Agents – By Sebastian Raschka, PhD (sebastianraschka.com)
		10 points by rbanffy 28 days ago \| past \| 1 comment
		Using Local Coding Agents (sebastianraschka.com)
		5 points by mariuz 33 days ago \| past \| 1 comment
		Using Local Coding Agents (sebastianraschka.com)
		3 points by matt_d 34 days ago \| past
		Using Local Coding Agents (sebastianraschka.com)
		2 points by Anon84 35 days ago \| past
		LLM Research Papers: The 2026 List (January to May) (sebastianraschka.com)
		5 points by ibobev 54 days ago \| past
		KV Sharing, MHC, and Compressed Attention (sebastianraschka.com)
		35 points by gmays 74 days ago \| past \| 3 comments
		Developments in LLM Architectures: KV Sharing, MHC, and Compressed Attention (sebastianraschka.com)
		4 points by ibobev 75 days ago \| past
		Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention (sebastianraschka.com)
		3 points by pretext 76 days ago \| past
		Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention (sebastianraschka.com)
		2 points by vismit2000 77 days ago \| past
		My Workflow for Understanding LLM Architectures (sebastianraschka.com)
		4 points by ibobev 3 months ago \| past
		Components of a Coding Agent (sebastianraschka.com)
		300 points by MindGods 3 months ago \| past \| 90 comments
		Claude Code's Real Secret Sauce Isn't the Model (sebastianraschka.com)
		6 points by ModelForge 4 months ago \| past
		A Visual Guide to Attention Variants in Modern LLMs (sebastianraschka.com)
		9 points by Brajeshwar 4 months ago \| past
		A Visual Guide to Attention Variants in Modern LLMs (sebastianraschka.com)
		23 points by Anon84 4 months ago \| past \| 1 comment
		LLM Architecture Gallery (sebastianraschka.com)
		586 points by tzury 4 months ago \| past \| 41 comments
		A Round Up and Comparison of 10 Open-Weight LLM Releases in Spring 2026 (sebastianraschka.com)
		4 points by MindGods 5 months ago \| past
		Categories of Inference-Time Scaling for Improved LLM Reasoning (sebastianraschka.com)
		1 point by ibobev 6 months ago \| past
		Understanding and Coding the Self-Attention Mechanism of LLMs from Scratch (sebastianraschka.com)
		1 point by onurkanbkrc 6 months ago \| past \| 1 comment
		The State of LLMs 2025: Progress, Problems, and Predictions (sebastianraschka.com)
		1 point by nsainsbury 6 months ago \| past
		The State of LLMs 2025: Progress, Problems, and Predictions (sebastianraschka.com)
		3 points by ModelForge 7 months ago \| past
		The State of LLMs 2025: Progress, Progress, and Predictions (sebastianraschka.com)
		4 points by ibobev 7 months ago \| past
		The State of LLMs 2025: Progress, Progress, and Predictions (sebastianraschka.com)
		9 points by vismit2000 7 months ago \| past
		New LLM Pre-Training and Post-Training Paradigms (sebastianraschka.com)
		2 points by lr0 7 months ago \| past \| 1 comment
		More