Is your feature request related to a problem? Please describe. I am using LiteLLM models for agents and would like to use the same models for eval judges. atm, it appears only Google API models are ...
When evaluating text in other languages (e.g., Thai, etc.), the eval logic incorrectly returns mismatches (Match score: 0)— even when the evaluated expression ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果