Bio
Papers
News
Experience

Recent & Upcoming Talks
- Example Talk
Publications
Projects
Blog
Projects
Teaching
- Learn JavaScript
- Learn Python
Work Experience

Mechanistic understanding and mitigation of language model non-factual hallucinations

Jan 1, 2024·

Lei Yu

,

Meng Cao

,

Jackie Chi Kit Cheung

,

Yue Dong

· 0 min read

Type

Journal article

Publication

arXiv preprint arXiv:2403.18167

Last updated on Jan 1, 2024

← Mechanisms of non-factual hallucinations in language models Jan 1, 2024

Robust LLM safeguarding via refusal feature adversarial training Jan 1, 2024 →

© 2025 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.