Back to marketplace
Compare different LLM responses
CREATED BY
36 Templates
9.8k Views
LAST UPDATED
November 11, 2025SOLUTION
Engineering
INTEGRATIONS USED
DESCRIPTION
LLM Response Comparison Tool
What does this flow do?
Compare responses generated by different large language models (LLMs) based on the same input prompts and/or uploaded files.
Here's how it works:
-
Input the prompt for which you want to compare outputs.
-
Optional: upload a file (if part of your prompt)
-
The LLM Comparison subflow runs the same prompt through 8 different LLM models. By default, the model tests:
-
GPT 4.1 Mini
-
GPT 5 Mini
-
OpenAI o4-mini
-
OpenAI o3-mini
-
Claude 4.5 Haiku
-
DeepSeek V3
-
Gemini 2.5
-
Grok 3 Mini
-
When to use
This tool helps you compare outputs and credit costs from different AI providers or model versions, helping you decide which models to use in different situations.
How to customize
- Change which models are being tested: Go into the "LLM Comparison" subflow and add a new node (or change an existing node) to test the model you're interested in. Don't forget to change the text in the "Combine Text" node and/or add new inputs if necessary.
HOW DO YOU SET THIS UP?
1.
Optional: configure the LLMs you want to test
You can do this by going into the LLM Comparison subflow. Don't forget the edit the Combine Text node.
2.
Input your prompt
Enter the prompt that you want each LLM to process.
3.
Optional: upload a file
If a file is part of your prompt, you can upload it here.
Hand-picked by the Gumloop team

