Spaces:

LLM-LAT
/

README

Running

stecas commited on Aug 24, 2024

Commit

30ec6ae

verified ·

1 Parent(s): 0a92d7a

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -25,6 +25,8 @@ See our [GitHub:](https://github.com/aengusl/latent-adversarial-training).
 Read the paper on arXiv: [Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs](https://arxiv.org/abs/2407.15549).
 ```
 @article{sheshadri2024targeted,
   title={Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs},

 Read the paper on arXiv: [Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs](https://arxiv.org/abs/2407.15549).
+Chat with our robust refusal model ([https://huggingface.co/LLM-LAT/robust-llama3-8b-instruct](https://huggingface.co/LLM-LAT/robust-llama3-8b-instruct)) at [https://www.abhayesian.com/lat-chat](https://www.abhayesian.com/lat-chat).
 ```
 @article{sheshadri2024targeted,
   title={Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs},