Cover image

LoRA and Order: The Strange Case for One Well-Placed Adapter

Opening — Why this matters now Enterprise AI is entering its less glamorous, more useful phase: not “Can we connect an LLM to everything?” but “Can we adapt it without making the GPU bill look like a small infrastructure project?” Fine-tuning still matters. Retrieval helps with knowledge access, prompt engineering helps with behavior shaping, and agent frameworks help with workflow orchestration. But many businesses eventually hit the same wall: the base model is close, yet not close enough. It needs domain style, task format, compliance habits, tool-use discipline, or workflow-specific judgment. That usually means some form of supervised fine-tuning. ...

May 9, 2026 · 15 min · Zelina
Cover image

Trex Marks the Spot: When AI Starts Training AI

Opening — Why this matters now Everyone wants custom AI. Few want the invoices, GPU queues, brittle data pipelines, and endless hyperparameter arguments required to build it. Fine-tuning large language models remains one of the least glamorous bottlenecks in modern AI deployment. It is expensive, iterative, and strangely dependent on whoever in the room has the strongest opinions. ...

April 16, 2026 · 4 min · Zelina
Cover image

Active Minds, Efficient Machines: The Bayesian Shortcut in RLHF

Why this matters now Reinforcement Learning from Human Feedback (RLHF) has become the de facto standard for aligning large language models with human values. Yet, the process remains painfully inefficient—annotators evaluate thousands of pairs, most of which offer little new information. As AI models scale, so does the human cost. The question is no longer can we align models, but can we afford to keep doing it this way? A recent paper from Politecnico di Milano proposes a pragmatic answer: inject Bayesian intelligence into the feedback loop. Their hybrid framework—Bayesian RLHF—blends the scalability of neural reinforcement learning with the data thriftiness of Bayesian optimization. The result: smarter questions, faster convergence, and fewer wasted clicks. ...

November 8, 2025 · 4 min · Zelina