🤔 Does num_batch do anything? If so what and how can we use it? I only know that setting it a value like 4096 makes the model so large that it does not even load at all into my CPU, but ends up 100% ...
Batch processing of embeddings is not supported in this fork yet. See this PR of the original binding which added multi-sequence support for embeddings.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果