llm_generate
ActiveTool of IA-QA — 130+ QA & Dev Tools for AI Agents
Generate text using open-source LLM models hosted on Groq (ultra-fast) or HuggingFace Inference (serverless). No API key required — the server provides its own keys. Supported models: Qwen3 32B, Gemma 4 27B, Gemma 3 27B, Llama 3.3 70B, Llama 4 Scout, DeepSeek R1, Mistral Small 24B, and more. Use list_llm_models to see the full catalog. Rate-limited to prevent abuse.
Parameters schema
{
"type": "object",
"required": [
"prompt"
],
"properties": {
"model": {
"type": "string",
"description": "Model ID (default: \"qwen/qwen3-32b\"). Use list_llm_models tool with provider \"Groq\" or \"HuggingFace\" to see available models."
},
"prompt": {
"type": "string",
"description": "The user prompt / instruction to send to the model"
},
"system": {
"type": "string",
"description": "Optional system prompt to set context or persona"
},
"max_tokens": {
"type": "number",
"description": "Maximum tokens to generate (default: 2048, max: 4096)"
},
"temperature": {
"type": "number",
"description": "Sampling temperature 0.0–1.5 (default: 0.7)"
}
}
}Parent server
IA-QA — 130+ QA & Dev Tools for AI Agents
https://github.com/jcjamet/ia-qa
1/7 registries