Skip to main content

2026


RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

We study the problem of user preference drift in LLM-based recommendation and propose RAIE, a region-aware incremental editing framework. Instead of global updates or instance-level edits, RAIE introduces preference regions as structured units for localized adaptation. This design enables efficient continual learning while preserving stable preferences.

Please Refuse to Answer Me! Mitigating Over-Refusal in LLMs via Adaptive Contrastive Decoding

Safety-aligned LLMs frequently generate refusal responses to harmless queries due to superficial lexical similarity with malicious ones — a phenomenon known as over-refusal. Existing approaches either reduce over-refusals or preserve safety, but rarely achieve both simultaneously. We propose AdaCD, a training-free and model-agnostic adaptive contrastive decoding method that dynamically adjusts the refusal token distribution to mitigate over-refusal while maintaining or even enhancing model safety.

2025


🎊 Official Lab Website Launched

After careful design and development, the official website of the Knowledge Intelligence Lab (KILab) at Sun Yat-sen University is now fully upgraded and online! The new website adopts modern design concepts, providing visitors with a clearer and more intuitive browsing experience.

2023