Language Pairs

Persian to English: AI Translation Comparison

Updated 2026-03-10

Persian to English: AI Translation Comparison

Persian (Farsi) is spoken by approximately 110 million people across Iran, Afghanistan (as Dari), and Tajikistan (as Tajik), with significant diaspora communities in the United States, Canada, Germany, and the United Kingdom. It is an Indo-Iranian language written in a modified Arabic script (Perso-Arabic), with Tajik using Cyrillic. Persian features relatively simple grammar for an Indo-European language — no grammatical gender, no case marking on nouns, and a consistent SOV word order — but has a rich literary tradition, complex politeness registers (taarof), and extensive Arabic vocabulary integration. Translation demand is driven by diaspora communication, academic and literary research, legal and immigration documentation, business relations, and media.

This comparison evaluates five leading AI translation systems on Persian-to-English accuracy, naturalness, and suitability for different use cases.

Translation comparisons are based on automated metrics and editorial evaluation. Quality varies by language pair and content type.

Accuracy Comparison Table

SystemBLEU ScoreCOMET ScoreEditorial Rating (1-10)Best For
Google Translate33.40.8287.1General-purpose, free access
DeepL31.20.8116.8Basic functionality
GPT-436.80.8517.8Contextual understanding, literary quality
Claude35.10.8397.4Long-form, academic content
NLLB-20030.50.8046.6Free, self-hosted option

Translation Quality Metrics: BLEU, COMET, and Human Evaluation Explained

Example Translations

Source: “Dadgah-e ali-ye keshvar ra’y-e khood ra dar mowred-e e’teraz-e matrah-shodeh be hoghoogh-e malekiyyat-e fekri sader kard.”

SystemTranslation
GoogleThe Supreme Court of the country issued its ruling on the objection raised regarding intellectual property rights.
DeepLThe country’s Supreme Court has issued its ruling on the objection to intellectual property rights.
GPT-4The Supreme Court has issued its ruling on the appeal filed regarding intellectual property rights.
ClaudeThe Supreme Court of the country has issued its ruling on the objection raised concerning intellectual property rights.
NLLB-200The Supreme Court of the country issued its ruling on the objection raised about intellectual property rights.

Assessment: GPT-4 correctly renders “e’teraz-e matrah-shodeh” as “the appeal filed” rather than “the objection raised” — in a Supreme Court context, “appeal” is the appropriate English legal term. DeepL’s “objection to intellectual property rights” misrepresents the meaning (it is an objection concerning IP rights, not against them). Claude’s “concerning” is a natural legal English preposition for this context.

Casual Conversation

Source: “Salam, chetori? Kheili vaghte nadidemit. Biya berim ye chai bokhhorim.”

SystemTranslation
GoogleHello, how are you? I haven’t seen you for a long time. Come, let’s go have a tea.
DeepLHello, how are you? It’s been a long time since I’ve seen you. Come, let’s go have tea.
GPT-4Hey, how are you? It’s been forever since I’ve seen you. Come on, let’s go grab a tea.
ClaudeHello, how are you? I haven’t seen you for a very long time. Come, let’s go have a tea.
NLLB-200Hello, how are you? I haven’t seen you for a long time. Come, let’s go drink a tea.

Assessment: GPT-4 captures the casual warmth best with “Hey,” “It’s been forever,” and “grab a tea.” Persian casual speech is warmer and more effusive than English, and GPT-4’s translation best bridges this cultural gap. The phrase “chai bokhhorim” (literally “drink tea”) is a social invitation — “grab a tea” captures the casualness while “drink a tea” (NLLB-200) sounds unnatural in English. Persian tea culture is central to social interaction.

Technical Content

Source: “In sakhtar-e narm-afzari az me’mari-ye microservice estefadeh mikonad va ba komak-e containerha moghayas-paziri-ye bala ra taamin mikonad.”

SystemTranslation
GoogleThis software structure uses microservice architecture and provides high scalability with the help of containers.
DeepLThis software architecture uses microservices and provides high scalability using containers.
GPT-4This software architecture utilizes a microservices-based design and achieves high scalability through containerization.
ClaudeThis software structure uses microservice architecture and provides high scalability with the help of containers.
NLLB-200This software structure uses microservice architecture and provides high scalability with the help of containers.

Assessment: GPT-4 stands out with “containerization” (the correct technical term rather than “containers”) and “achieves high scalability through” (more natural than “provides…with the help of”). DeepL correctly uses “microservices” (plural) and has clean sentence flow. Google, Claude, and NLLB-200 produce identical but less polished translations with the awkward “with the help of.” How AI Translation Works: Neural Machine Translation Explained

Strengths and Weaknesses

Google Translate

Strengths: Free and accessible. Handles Perso-Arabic script well. Substantial Persian web training data. Weaknesses: Misses taarof (politeness) nuances. Literal translations. Less natural than GPT-4 or Claude.

DeepL

Strengths: Reasonable sentence restructuring. Acceptable for general content. Weaknesses: Lower accuracy for Persian specifically. Occasional meaning distortion. Does not handle Dari or Tajik variants.

GPT-4

Strengths: Best overall quality. Excellent literary and contextual understanding. Handles taarof and register shifts. Strong technical and legal terminology. Weaknesses: Higher cost. May occasionally mix Dari or Tajik influences with Iranian Persian.

Claude

Strengths: Strong quality for long documents. Good academic register. Consistent and reliable. Weaknesses: Less dynamic with casual Persian. Sometimes overly literal with literary expressions.

NLLB-200

Strengths: Free and self-hostable. Covers Persian, Dari, and Tajik as separate languages. Reasonable baseline. Weaknesses: Identical output to Google for many inputs. No register adaptation. Less fluent than GPT-4 or Claude.

Recommendations

Use CaseRecommended System
Quick personal translationGoogle Translate (free)
Legal and immigration docsGPT-4 with human review
Literary and academic textsGPT-4 or Claude
Business communicationGPT-4
High-volume processingNLLB-200 (self-hosted)
Diaspora communicationGoogle Translate or GPT-4
News and mediaGoogle Translate or Claude

Best Translation AI in 2026: Complete Model Comparison

Key Takeaways

  • GPT-4 leads for Persian-to-English with the highest scores across all metrics and particularly strong performance on literary, legal, and contextually nuanced content.
  • Persian’s taarof (elaborate politeness) system creates translation challenges, as overly literal translations of polite phrases sound bizarre in English, while omitting them loses cultural meaning.
  • The Persian-Dari-Tajik continuum means training data from all three varieties contributes to AI quality, but can also introduce cross-variant confusion, particularly for Dari-specific or Tajik-specific terminology.
  • Persian’s relatively simple grammar (no gender, no case) makes it more amenable to AI translation than many languages of similar resource level, contributing to scores that approach high-resource pair quality.

Next Steps