UW-NSL
Pinned Loading
Repositories
- Temporal_Forgetting Public
uw-nsl/Temporal_Forgetting’s past year of commit activity - safechain Public
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
uw-nsl/safechain’s past year of commit activity - ChatBug Public
[AAAI25] Official Repo of Paper `ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates`
uw-nsl/ChatBug’s past year of commit activity - kodcode Public Forked from KodCode-AI/kodcode
Generate diverse coding questions and verifiable solutions - all in one framework
uw-nsl/kodcode’s past year of commit activity - CleanGen Public
[EMNLP 24] Official Implementation of CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
uw-nsl/CleanGen’s past year of commit activity - ArtPrompt Public
[ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`
uw-nsl/ArtPrompt’s past year of commit activity - SafeDecoding Public
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
uw-nsl/SafeDecoding’s past year of commit activity - edc Public
Source Code for "EDC: Effective and Efficient Dialog Comprehension For Dialog State Tracking" (NAACL 2024)
uw-nsl/edc’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…