"I Don't Understand LLM Internals": Good. You Don't Need To.

Feeling left behind by the LLM wave because you're not a deep learning expert? Good news: Your existing engineering skills + API calls are all you need.

May 05, 2025

TL;DR:

You don't need deep AI theory knowledge to build valuable AI applications; focus on effectively using existing tools and APIs.
Think of AI proficiency in four levels (Pioneers, Engineers, Developers, Users); this post is targeted towards Level 3: AI Developers (engineers integrating AI).
Leverage powerful APIs with techniques like prompt engineering, Retrieval-Augmented Generation (RAG), and In-Context Learning (ICL) – you often don't need custom fine-tuning, massive datasets, or expensive hardware.
The most valuable skill for AI Developers is framing problems correctly and understanding what LLMs can achieve, not their internal mechanics.

Note: For the rest of this post, I’ll use the term AI interchangeably with GenAI and LLMs for brevity.

The buzz around AI can feel overwhelming. I often get messages from friends and colleagues asking, "Do I need to learn neural networks and deep learning to work with LLMs?". Some developers may dive into deploying open-source models locally, only to find the results underwhelming compared to state-of-the-art APIs. Others get stopped before they even start, worrying about the potential costs — "Will I need a farm of expensive GPUs?" or "Are API bills going to break the bank?".

I've noticed this misconception is widespread. It's like thinking you need to understand internal combustion engines before driving a car, or how TCP/IP works before building a website. This might keep many talented developers from exploring the AI space.

With terms like "embedding vectors", "fine-tuning", and "prompt engineering" flying around, it’s hard to know where to start or how deep you need to go. Let me offer a framework I've found helpful.

The Four Levels of AI Engagement

I see engaging with AI at four distinct levels:

Level 1: AI Pioneers: These are the deep researchers, the academics, the theorists pushing the absolute boundaries. They develop novel architectures, create entirely new ML approaches, publish papers, and define the ethical guardrails.
Level 2: AI / ML Engineers: These are the ML engineers and data scientists who take the pioneering research and actually build, train, and optimize foundation models (like GPT, Claude, and Gemini). They fine-tune models, optimize inference, manage training infrastructure, and work at places like OpenAI, Google, Anthropic & Meta AI.
Level 3: AI Developers: This is where most software developers and product creators fit in. They take the powerful models created by Level 2 folks and integrate them into applications using APIs, build systems with Retrieval-Augmented Generation (RAG), implement prompting techniques, or maybe do some light fine-tuning. They solve real-world problems using AI as a tool.
Level 4: AI Users: This is basically everyone else. Professionals using AI tools (like ChatGPT, Copilot, Midjourney) to boost their productivity, students using them for research, artists for creation, and everyday people leveraging AI for convenience. This level requires knowing how to direct the tools, not technical expertise.

For most of us, especially engineers looking to add AI features to products, Level 3: AI Developers is the sweet spot.

So, You Want to Be an AI Developer (Level 3)? Let's Bust Some Myths.

Myth 1: "I need to understand how neural networks and transformers work to build applications with LLMs."

That's not the reality. As I said, it's like needing to understand the engine to drive the car. Having a good mental model of what LLMs are good at is far more important than knowing exactly how they work internally.

"Working with LLMs is like learning to be a good manager: you need to learn how to give clear instructions and interpret the results, not understand how your team members' brains work."
— Simon Willison

You need to understand the capabilities and limitations – things like context window limits (the amount of information the model can consider at once), the potential for hallucination (the model confidently making things up), and that they don't inherently know real-time information.

Myth 2: "I need my own beefy GPU setup or a dedicated AI server farm."

That's absolutely not the case, especially when starting out. You can prototype and build significant applications entirely using hosted APIs. I built my first LLM application using nothing but OpenAI's API and a bit of Python, running locally. No GPU required.

Services like OpenRouter (aggregator for various models), GitHub Models, and Google AI Studio / Gemini API provide access to powerful models without the need for a GPU or dedicated infrastructure for FREE. However, they might use your data for training or research, which is fine for hobby projects or while learning. If this is a concern, you can opt for their paid offerings.

There are other services like:

These platforms handle massive infrastructure for you. You just send API requests.

Myth 3: "I must train or fine-tune my own custom model to get any real value."

Not necessarily, and maybe even not desirably in many cases. This used to be true before, but the power of today's foundation models combined with clever prompting techniques often gets you further, faster, and cheaper. Prompt engineering coupled with In-Context Learning (ICL) is surprisingly potent. ICL is the model's ability to learn from examples provided directly within the prompt, without needing its internal weights changed or re-training the model by fine-tuning.

Fine-tuning still has its place for deep specialization, but consider the downsides:

Cost & Complexity: It requires curated datasets, compute resources for training, and ongoing maintenance.
Scalability Issues: Do you really want to fine-tune every time a significantly better base model comes out (which is happening constantly)? Maintaining multiple fine-tuned models gets complex and expensive quickly.
ICL Can Be Better: For many tasks, providing good examples in the prompt (few-shot or many-shot ICL) can achieve performance comparable to or even better than fine-tuning, especially when combined with techniques like Retrieval-Augmented Generation (RAG) – giving the model access to external, specific knowledge it wasn't trained on – to provide relevant, up-to-date context.

Start with great prompting and RAG and it should cover 99% of your use cases. Only consider fine-tuning if you hit a performance ceiling that simpler methods can't overcome.

Myth 4: "If I don't code in Python, I'm out of luck."

The reality is that while Python has a rich ecosystem (hello, LangChain), it's not the only game in town. Most major API providers offer SDKs in multiple languages. We now have excellent libraries like LangChain.js for JavaScript, ruby-openai for Ruby, LangChain4j for Java, and more.

And beyond code, there are increasingly capable low-code/no-code platforms like Flowise, and Langflow that integrate LLM capabilities, allowing you to build flows visually.

Myth 5: "This is going to cost me hundreds or thousands in API bills!"

It can, but it doesn't have to if you design for cost-efficiency from day one. I recently built a document processing app analyzing hundreds of pages daily that costs less than $10/month in API fees due to careful optimization.

Start cheap: Use less expensive, capable models (like Gemini 2.5 Flash, Deepseek R1, or GPT 4.1) for initial development. Only upgrade when needed.
Prompt caching: Most API providers support prompt caching which can reduce your costs upto 90%.
Cache results: Don't call the API for the exact same query repeatedly.
Set token limits: Be mindful of input and output lengths.
Use RAG: Retrieval Augmented Generation can reduce the amount of context you need to stuff into prompts.
Monitor usage: Keep an eye on your API dashboards and configure billing alerts.
Evaluate carefully: Ensure the LLM is actually adding value commensurate with its cost.

Myth 6: "AI development requires a deep understanding of complex algorithms and mathematical concepts."

That's not the reality for Level 3: AI Developers. While complex algorithms are the bedrock built by Level 1 & 2 folks, we can build incredibly useful applications by mastering the application layer. You can leverage prompt engineering and RAG here. Function Calling / Tool Use is another useful technique that allows the model to interact with external APIs & tools to get real-time information or perform actions.

Focusing on how to effectively implement these techniques and integrate them into your software is far more valuable than deriving backpropagation formulas.

Myth 7: "You need massive datasets to train your own models for any useful application."

"I need terabytes of data just to get started, right?" Wrong. While foundational models do require massive datasets (that's OpenAI's or Google's problem, not yours as a Level 3 dev), adapting these models often doesn't.

And remember, for many applications, leveraging pre-trained models via APIs combined with prompt engineering and RAG is entirely sufficient, bypassing the need for custom training data altogether.

Myth 8: "Building AI applications is only for large companies with significant resources."

"This AI stuff looks fancy and expensive. Surely only Big Tech can afford it?" That might have been true a few years ago, but the landscape has dramatically changed. Democratizing factors include:

Accessible APIs: Major model providers (OpenAI, Google, Anthropic, etc.) offer powerful models via APIs with pay-as-you-go pricing, often with generous free tiers to start.
Competitive Pricing: Costs are falling, and aggregators like OpenRouter provide access to various models at different price points. You can even find tools to estimate costs.
Open Source: A thriving ecosystem of open-source models (e.g., Llama, Mistral) and frameworks (LangChain, LlamaIndex) lowers the barrier to entry.
Cloud Platforms: Services like Hugging Face, Google Vertex AI, or AWS Bedrock simplify deployment and management.

While building a foundational model is resource-intensive, building on top of them is increasingly feasible for individuals, startups, and small teams. Focus on cost optimization (Myth 5) and clever application design.

Myth 9: "The AI field is moving too fast, and anything I learn now will quickly become obsolete."

"It feels like there's a new model or technique every week! How can I possibly keep up? Won't my skills be useless next year?" It's a valid concern given the pace, but foundational knowledge and skills have surprising longevity.

While specific model names or the hottest new framework might change, the underlying principles remain relevant:

Core Technical Skills: Proficiency in programming (especially Python), data handling, APIs, and cloud basics are always valuable.
Understanding Capabilities: Developing a strong mental model of what LLMs can do, their strengths (reasoning, generation) and weaknesses (hallucination, knowledge cutoffs), is crucial for effective application.
Problem-Solving & Design: Knowing how to break down a problem and design a system using AI components (like RAG, function calling) is a durable skill.
Prompt Engineering: While the specifics evolve, the core principles of clear communication and providing context remain key.

Think of it like web development: frameworks and libraries change constantly, but understanding HTML, CSS, JavaScript, HTTP, and good design principles provides a stable foundation. The most important skill is learning how to learn and adapt.

Getting Started as an AI Developer (Level 3)

So, how do you actually begin?

Choose Your API Provider: Start with a reliable hosted API rather than self-hosting. Many have free tiers or credits to get you started. OpenRouter or GitHub Models are a good starting point.
Pick a Framework (optional but helpful): While you can use APIs directly, frameworks like LangChain or LlamaIndex (available in Python, JS, etc.) provide useful abstractions for common patterns like conversation history, RAG, structured outputs, and tool use. Check out documentation from OpenRouter to integrate with most popular frameworks here.
Start Simple, Then Iterate: Don't try to build the next ChatGPT immediately. Pick a small, well-defined problem. Maybe a chatbot for specific FAQs, a summarizer, or a tool to extract specific data from text. Build a basic version, see how it works, and then enhance it.

Learning More About Prompting

Learn the Mental Models, Not the Math: Focus on what problems LLMs are good at and how to prompt them effectively. This comes from practice and experimentation. It's like learning to cook – you get better by trying recipes and adjusting ingredients, not by studying the chemistry of Maillard reactions.

Here are some resources from leading providers to help you learn more:

The Real Challenge: Problem Framing

The hardest part often isn't the code; it's framing the problem correctly. Understanding what LLMs can and can't do well is more valuable than knowing the model internals.

Ask yourself:

Is this task about recalling specific knowledge (good for RAG) or general reasoning?
Does it need multi-step thought or complex instruction following?
Does the AI need memory of past interactions?
Is the goal information retrieval, content creation, decision support, or autonomous action?

Answering these helps design your system and prompts effectively.

Conclusion: Just Start Building

I don't really have a silver bullet or perfect learning path. The field moves too fast. But I'm confident you'll learn more from building one simple application than from reading a dozen research papers.

Leverage the incredible work done by the Pioneers and AI Engineers through APIs. You're much better off starting with APIs than trying to deploy and manage open-source models yourself.

So, my advice to you is: stop worrying about becoming a deep learning expert overnight. Focus on identifying a problem where an LLM could help, pick an API, and build a small proof-of-concept.

Your likes and comments motivate me to write more. If you found this article helpful, please share it with another curious mind.

Are there topics you’d like to read about, let me know in the comments.

Have you tried dabbling with LLMs? What challenges are you facing? Leave a comment and I’d be happy to discuss them with you.

Pixels & Ponderings

Discussion about this post