We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bad_token_ids
llmart has the capability of banning "bad" tokens from the adversarial optimization.
llmart
Right now bad_token_ids implements a static policy for what is considered a "bad" token (non-printability, ascii-only):
LLMart/src/llmart/tokenizer.py
Lines 428 to 444 in c7bbef3
Being able to add configurable policies would help with non-ascii languages. Additionally, being able to ban a set of tokens would also be beneficial.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
llmart
has the capability of banning "bad" tokens from the adversarial optimization.Right now
bad_token_ids
implements a static policy for what is considered a "bad" token (non-printability, ascii-only):LLMart/src/llmart/tokenizer.py
Lines 428 to 444 in c7bbef3
Being able to add configurable policies would help with non-ascii languages. Additionally, being able to ban a set of tokens would also be beneficial.
The text was updated successfully, but these errors were encountered: