Science ❯ Computer Science ❯ Machine Learning ❯ Language Models
Fresh arXiv papers report measurable gains from agentic designs alongside a black-box attack that forces refusals across models.