When Fine-Tuning Bites Back: The Hidden Safety Drift in Vision-Language Agents
Customization sounds harmless. A company takes a capable vision-language model, adds a lightweight adapter, fine-tunes it on a narrow internal dataset, and calls the result “domain-specialized.” The dashboard still has green boxes. boxes. The model still answers normal text questions. The update is cheap, fast, and reversible in theory. Everyone goes home with the comfortable feeling that parameter-efficient fine-tuning is basically a productivity tool with a nerdy name. ...