LLMs are the core artificial intelligence models for many business applications, such as AI agents, RAG-powered question-answering or customer service chatbots with automated text generation. Natural language processing (NLP) is the use of machine learning algorithms to understand and generate human language, and LLMs are a specific type of NLP model.
Notable LLMs include OpenAI’s GPT family—such as GPT-4o and GPT-3.5, some of the models behind ChatGPT—as well as Anthropic’s Claude, Google’s Gemini and Meta’s Llama 3. All LLMs are capable of handling complex tasks, but the specific needs of a machine learning project can help dictate the right LLM for the job.
Choosing the right LLM comes down to a range of factors including:
- Specific use case: The machine learning challenge directly affects the LLM selection process. One LLM might be better with lengthy document comprehension and summarization, while another might be easier to fine-tune for domain-specific uses.
- Performance: Just like other models, LLMs can be benchmarked against each other to evaluate performance. LLM benchmarks include metrics for reasoning, coding, math, latency, comprehension and general knowledge. Weighing the needs of a project versus benchmark performance can help determine the best LLM to choose for high-quality outputs.
- Open versus closed source: Open source models enable observers to monitor how the model reaches its decisions. Different LLMs can be prone to biases and hallucinations in various ways: when they generate predictions that do not reflect real-world outcomes. When content moderation and bias prevention are paramount, limiting choices to open source providers can help shape the LLM selection process.
- Resource use and cost: LLMs are resource-hungry models. Many LLMs are powered by hyperscale datacenters filled with hundreds of thousands of graphics processing units (GPUs) or more. LLM providers also charge differently for API connections to their models. The scalability of a model and its pricing system directly affects project scope.