llm_fit_finder

Active

Tool of IA-QA — 130+ QA & Dev Tools for AI Agents

declared in 1.0.0

Find the best LLM for a given use case. Compares 30+ cloud API models and 12+ local models by cost, speed, benchmarks, features and VRAM requirements. Returns ranked recommendations with cost simulation. No API key needed.

Parameters schema

{
  "type": "object",
  "properties": {
    "mode": {
      "type": "string",
      "description": "cloud (API models) or local (Ollama/self-hosted). Default: cloud"
    },
    "top_n": {
      "type": "number",
      "description": "Number of recommendations to return (default: 5)"
    },
    "vram_gb": {
      "type": "number",
      "description": "GPU VRAM in GB (only for mode=local). Default: 16"
    },
    "features": {
      "type": "array",
      "items": {
        "type": "string"
      },
      "description": "Required features: vision, function_calling, json_mode, streaming, reasoning"
    },
    "use_case": {
      "type": "string",
      "description": "Primary use case: chatbot | code | rag | summarization | classification | reasoning | agents | multilingual"
    },
    "max_budget": {
      "type": "number",
      "description": "Maximum monthly budget in USD (based on tokens_per_day)"
    },
    "quantization": {
      "type": "string",
      "description": "Quantization (only for mode=local): Q4_K_M | Q8_0 | FP16. Default: Q4_K_M"
    },
    "tokens_per_day": {
      "type": "number",
      "description": "Estimated daily token volume (default: 100000)"
    }
  }
}

What this tool wraps· 0 endpoints

min confidence0.70 0.50

No endpoints wrapped at confidence ≥ 0.70.

Parent server

IA-QA — 130+ QA & Dev Tools for AI Agents

https://github.com/jcjamet/ia-qa

1/7 registries

View full server →