What is prompt engineering?

Prompt engineering is the practice of crafting inputs to AI language models to produce accurate, useful, and reliable outputs. It involves choosing the right words, structure, context, and format to guide the AI toward the response you actually need — rather than a generic or off-target one.

Which AI models benefit most from better prompting?

All major large language models — including ChatGPT (GPT-4o), Claude, and Gemini — respond significantly to prompt quality. The same task can produce dramatically different results depending on how you structure your request. Better prompting improves output across every major model.

Do I need technical skills to do prompt engineering?

No. Prompt engineering is done in natural language — you write text instructions, not code. Basic prompting needs no technical background at all. Advanced techniques like prompt chaining or agentic workflows can benefit from light scripting knowledge, but the core skill is clear written communication.

Where can I learn more about prompt engineering?

MasterPrompting.net offers a structured curriculum from beginner to advanced, covering every major technique from basic clarity and context to chain-of-thought, meta-prompting, and agentic workflows. Start with the Beginner track to build a solid foundation.

GPT-5 vs Claude Opus 4.6 vs Gemini 3: Honest Comparison for Indian Developers (With INR Pricing)

Every major AI comparison in 2026 is written by someone in San Francisco with a corporate card. They benchmark on English creative writing, US-centric use cases, and dollar pricing — then call it a global analysis.

This one is different. I'm looking at GPT-5, Claude Opus 4.6, and Gemini 3 Pro specifically through the lens of what Indian developers care about: real ₹ cost, how to actually access these APIs without an international credit card, and latency from Mumbai and Bangalore. The benchmark scores matter, but they're the last thing I'd look at before making a decision.

The models at a glance

Model	Maker	Context window	Input price (USD/1M)	Input price (INR/1M)	Int'l card required?	Free tier?
GPT-5	OpenAI	128K	$15.00	₹1,260	Yes (via direct API)	No
Claude Opus 4.6	Anthropic	200K	$15.00	₹1,260	No (via AICredits.in)	No
Claude Sonnet 4.6	Anthropic	200K	$3.00	₹252	No (via AICredits.in)	No
Gemini 3 Pro	Google	1M	$7.00	₹588	Limited	Yes (Gemini Flash)
Gemini 3 Flash	Google	1M	$0.075	₹6.30	Limited	Yes

A quick note on "limited" for Google: Google Pay India supports some Google Cloud billing, but the Gemini API billing has inconsistent acceptance of Indian payment methods. The free tier on Gemini Flash is real and generous — but the moment you need reliability or higher rate limits, you hit the same international card problem.

Quality benchmarks

Here are the headline benchmark scores for the flagship models as of early 2026:

Benchmark	GPT-5	Claude Opus 4.6	Gemini 3 Pro
SWE-bench (coding)	72%	74%	65%
MMLU (reasoning)	91.2%	90.8%	89.7%
HumanEval (code)	92%	94%	86%
MATH	91%	88%	87%

What benchmarks miss: Every one of these benchmarks is run in English, on English training data, under controlled conditions. They measure what models can do, not what they reliably do in production with real user prompts. Claude consistently outperforms its benchmarks on instruction following in my experience; GPT-5 benchmarks well but can be inconsistent on ambiguous prompts.

Honest verdict by use case:

Best for code generation: Claude Opus 4.6 (best HumanEval, strong instruction following)
Best for reasoning chains: GPT-5 (fastest, strong MMLU, good for chain-of-thought)
Best for long document processing: Gemini 3 Pro (1M context is genuinely different from 200K)
Best cost/quality balance: Claude Sonnet 4.6 or Gemini 3 Flash

Real ₹ cost breakdown for Indian developers

The numbers below use AICredits.in pricing for Claude models (10-11% markup over direct USD), approximate direct API pricing for GPT-5 (international card required), and Google Cloud billing for Gemini. Output tokens are typically 3-5x more expensive than input tokens.

For a side project

10K tokens/day (input + output), 30 days:

Model	Monthly input tokens	Monthly cost (INR)
Gemini 3 Flash (free tier)	300K	₹0 (within free limits)
Claude Sonnet 4.6 (AICredits)	300K	₹76
Gemini 3 Flash (paid)	300K	₹2
GPT-5 (direct, with int'l card)	300K	₹378
Claude Opus 4.6 (AICredits)	300K	₹378

At this scale, model choice barely registers as a cost. Pick based on quality.

For a startup product

500K tokens/day (input + output combined), 30 days:

Model	Monthly tokens	Input cost (INR)	Output cost (INR)	Total (INR)
Gemini 3 Flash	15M	₹95	₹378	~₹473
Claude Sonnet 4.6 (AICredits)	15M	₹3,780	₹18,900	~₹22,680
Gemini 3 Pro	15M	₹8,820	₹35,280	~₹44,100
GPT-5	15M	₹18,900	₹75,600	~₹94,500
Claude Opus 4.6 (AICredits)	15M	₹18,900	₹75,600	~₹94,500

Note: output tokens are assumed 4:1 ratio to input tokens in the above. Your actual ratio depends on response length.

At startup scale, the difference between Gemini Flash and Claude Sonnet 4.6 is ₹22,000/month. That's meaningful but manageable. The difference between Gemini Flash and the flagship models (GPT-5, Claude Opus) is ₹90,000+/month — which starts to be a real product decision.

For enterprise

At 10M tokens/day, you should be negotiating enterprise contracts directly with providers rather than paying listed API prices. OpenAI, Anthropic, and Google all offer volume pricing at this scale — typically 30-50% below list. Contact their enterprise sales teams; the process takes 2-4 weeks but the savings are significant.

The exception: if you need to stay in INR (for accounting, GST compliance, or avoiding forex risk), a gateway like AICredits.in may still be preferable even at enterprise volume. INR invoices from an Indian company are often easier to process than foreign invoices for Indian startups.

Accessibility without an international credit card

This is the part most comparison posts skip entirely.

GPT-5: Requires an international credit card to access OpenAI's API directly. No UPI, no domestic card, no net banking. There are workarounds (Wise card, virtual international cards), but none are as clean as UPI.

Claude Opus 4.6 / Sonnet 4.6: Same direct billing situation as OpenAI. However, both are available through AICredits.in with UPI payment. Minimum top-up ₹100, works with GPay, PhonePe, Paytm, and every Indian payment method.

Gemini 3 Pro / Flash: Google Cloud billing accepts some Indian payment methods, but acceptance is inconsistent — domestic credit cards sometimes work, UPI rarely works for API billing. The free tier (Gemini Flash, up to 1M tokens/day) is accessible without any billing setup, which makes it genuinely useful for prototyping.

Try it now with AICredits.in

Access GPT-5, Claude Opus 4.6, Gemini 3, and 300+ models with UPI payment in ₹. No international card needed. Create free account →

Latency from India

API calls from a server in Mumbai or Bangalore to each provider's nearest endpoint:

Model	Typical first token (ms)	Nearest endpoint
Gemini 3 Flash	400–700ms	Mumbai (Google Cloud asia-south1)
Gemini 3 Pro	800–1,400ms	Mumbai
Claude Sonnet 4.6 (via AICredits)	1,500–2,000ms	US-based + gateway hop
GPT-5 (via direct API)	1,200–2,000ms	US-based
Claude Opus 4.6 (via AICredits)	2,500–4,000ms	US-based + gateway hop

Gemini's clear advantage here comes from Google having inference infrastructure in Mumbai. Anthropic and OpenAI route Indian traffic through US data centers, which adds 200-300ms of round-trip latency before any model computation happens. The AICredits gateway adds another ~50-100ms on top.

For real-time user-facing features — search autocomplete, live transcription, conversational UI — Gemini's latency advantage is real and matters. For batch processing, RAG pipelines, or async agent tasks, the difference is irrelevant.

Coding performance — Indian developer use cases

I tested two tasks specifically relevant to Indian developer workflows.

Test 1: Razorpay webhook handler in Python

Prompt: "Write a Flask endpoint that handles Razorpay payment webhooks. Verify the webhook signature, handle payment.captured and payment.failed events, and log them to a PostgreSQL database."

Claude Opus 4.6: Produced a complete, correct implementation using hmac for signature verification, SQLAlchemy for DB, and included proper error handling for each event type. The code was production-ready on first pass.
GPT-5: Also correct, but the signature verification used a slightly outdated Razorpay SDK pattern. Required one revision.
Gemini 3 Pro: Correct logic, but the PostgreSQL schema it defined wasn't normalized — it stored event data as a JSON blob without proper indexing. Fine for a prototype, not for production.

Test 2: SQL query for BSE stock screener

Prompt: "Write a SQL query that finds all NSE-listed stocks where the 50-day moving average crossed above the 200-day moving average in the last 5 trading days, using a daily_prices table with columns: symbol, date, close, volume."

GPT-5: Got the golden cross logic right on the first try, including the window functions. Clean query.
Claude Opus 4.6: Correct, added a subquery for better readability, included a comment explaining the golden cross definition.
Gemini 3 Pro: Produced a working query but had an off-by-one error in the date range filter. Correct after one revision.

Both GPT-5 and Claude Opus 4.6 are strong for code that touches Indian financial systems or payment infrastructure. The models trained on more Indian developer content — Stack Overflow questions about Razorpay, NSE data schemas, GST computation — handle these natively.

The honest recommendation

Best for most Indian developers: Claude Sonnet 4.6 via AICredits.in

₹252/M input, accessible with UPI, excellent code quality, 200K context. It's not the cheapest or the most capable, but it's the most consistent and the easiest to actually access in India. The prompt library has copy-paste templates for common patterns that will cut your token usage by 20-30% on most tasks.

Best if cost is everything: Gemini 3 Flash free tier

1M tokens/day for free, with Google's Mumbai infrastructure for low latency. Use it for prototypes, dev tooling, or any application where model quality is secondary to availability and cost. The free tier is real and maintained.

Best for serious production work on a USD budget: GPT-5 for speed-critical applications, Claude Opus 4.6 for quality-critical ones

If your company has international billing sorted, GPT-5 is faster and has the largest ecosystem of integrations. Claude Opus is the better choice when code correctness and instruction following matter more than speed.

Next steps

AICredits.in review — the full breakdown of INR billing for AI APIs
DeepSeek R1 vs Claude India 2026 — if you're considering open-source models for cost reduction
Claude Code in India — no credit card — the fastest way to get agentic coding running with UPI payment

The models at a glance

Model	Maker	Context window	Input price (USD/1M)	Input price (INR/1M)	Int'l card required?	Free tier?
GPT-5	OpenAI	128K	$15.00	₹1,260	Yes (via direct API)	No
Claude Opus 4.6	Anthropic	200K	$15.00	₹1,260	No (via AICredits.in)	No
Claude Sonnet 4.6	Anthropic	200K	$3.00	₹252	No (via AICredits.in)	No
Gemini 3 Pro	Google	1M	$7.00	₹588	Limited	Yes (Gemini Flash)
Gemini 3 Flash	Google	1M	$0.075	₹6.30	Limited	Yes

Quality benchmarks

Here are the headline benchmark scores for the flagship models as of early 2026:

Benchmark	GPT-5	Claude Opus 4.6	Gemini 3 Pro
SWE-bench (coding)	72%	74%	65%
MMLU (reasoning)	91.2%	90.8%	89.7%
HumanEval (code)	92%	94%	86%
MATH	91%	88%	87%

Honest verdict by use case:

Best for code generation: Claude Opus 4.6 (best HumanEval, strong instruction following)
Best for reasoning chains: GPT-5 (fastest, strong MMLU, good for chain-of-thought)
Best for long document processing: Gemini 3 Pro (1M context is genuinely different from 200K)
Best cost/quality balance: Claude Sonnet 4.6 or Gemini 3 Flash