Beyond the Benchmark: Arm’s length compliance is a valuation tool, not an immunity against criminal allegations of artificiality

Elena Mihaela Șolcă
Elena Mihaela Șolcă

Abstract: This article challenges the assumption that arm’s length transfer pricing compliance shields multinational groups from legal risk. While pricing methodologies and documentation may satisfy tax authorities in a technical sense, they do not neutralise criminal exposure where intra-group transactions lack economic substance or reflect deliberate manipulation of legal form. As enforcement shifts from quantitative limits toward purpose-based and composite transaction analysis, deductibility of management fees, royalties, and consultancy charges is increasingly assessed through the lens of intent and coherence. The paper argues that arm’s length pricing is a methodological tool, not a test of legitimacy, and that transfer pricing documentation may, in certain circumstances, function as evidentiary material in criminal proceedings.

1. Introduction: The Illusion of Technical Safety

In practice, transfer pricing is still treated as a kind of technical inoculation. Most companies follow a familiar ritual: find the right market data, make sure the numbers fall within the “arm’s length” range, and write up a detailed report. There is a widespread belief that once you file that report, you have reached a safe legal harbor.

That assumption is no longer just optimistic, it is dangerous.

While following the rules might settle a routine tax dispute or prevent a fine, the system was never meant to protect against criminal charges. Indeed, in certain configurations, a technically “correct” price may serve a counter-intuitive function: it may stabilize a misleading fiscal narrative, rendering an artificial arrangement institutionally durable and shielding it from early detection.

The reason for this vulnerability is conceptually straightforward yet persistently overlooked by practitioners: Criminal liability does not attach to prices; it attaches to conduct. Legal culpability is a function of intent (mens rea) and the deliberate deployment of legal forms to obscure economic reality.

Under increasing scrutiny from both tax authorities and prosecutors, it is becoming clear that a correctly benchmarked number cannot redeem a knowingly artificial structure. If the underlying transaction lacks economic substance, the precision of the mathematical models used to price it becomes legally irrelevant. The contemporary risk in transfer pricing is therefore not numerical or statistical, it is structural and evidentiary.[1]

II. The Arm’s Length Paradox

Intra-group transactions occupy a delicate space between legitimate corporate coordination and deliberate fiscal engineering. While this space is tolerated, it is not neutral. Here lies the arm’s length paradox: the arm’s length principle is a pricing methodology, not a truth test. Yet, it is routinely treated by practitioners as a definitive proxy for structural legitimacy.

This is a category error.

A transaction may be “arm’s length” in isolation, yet patently abusive when viewed in context. A management fee may be correctly priced according to market benchmarks, yet remain duplicative or entirely unnecessary for the recipient. Similarly, a corporate reorganization may be impeccably documented and economically benchmarked, but specifically designed to generate tax losses without the assumption of corresponding commercial risk.[2] In these instances, documentation is not merely detached from economic reality; it serves to stabilize a false fiscal narrative. Once formalized, this story propagates across filings, reports, and internal governance, effectively converting aggressive planning into a sustained and institutionally durable misrepresentation.

This is where criminal law enters, not through transfer pricing rules, but through doctrines of abuse of law, false accounting, or tax evasion.

Advance pricing agreements are powerful evidence of good faith. They demonstrate transparency and reliance on administrative guidance, and they substantially weaken allegations of intent. But they are not shields. APAs validate pricing methodologies, not economic substance. They do not immunise hollow structures, nor protect conduct that falls outside their assumptions.

III. From Tolerated Avoidance to Penal Relevance

For decades, sophisticated tax planning operated within a zone of uneasy tolerance. While politically contested, it was frequently characterized as a legitimate exercise in boundary testing rather than a categorical boundary violation. However, the contemporary landscape reveals that this permissible space is narrowing, fast. .

Modern anti-abuse frameworks increasingly reject formal legality as a sufficient condition for fiscal recognition. The recent introduction of expanded general anti-abuse rules (GAAR), most notably the framework established by Law 239/2025[3], reflects a decisive shift in regulatory intent. Under this new architecture, transactions are no longer assessed solely through the lens of technical compliance or mechanical correctness. Instead, they are subjected to a multi-dimensional evaluation of substance, coherence, and underlying purpose.[4]

Intra-group transactions sit at the epicenter of this paradigm shift.[5] Functionally, they are commercially indispensable; group-wide services represent rational responses to the requirements of the global market. Yet, from a legal perspective, these dealings inhabit a permanent grey zone. They are economically real, but they lack the “shield” of a real market test. Unlike a deal between two independent companies, where each side fights for the best price, intra-group deals happen in a vacuum of opposing interests. The price, the scope, and the very reason for the deal are decided internally by a single corporate voice, rather than through the friction of the open market.While this structural reality does not render such transactions abusive per se, it does render them uniquely susceptible to fiscal calibration, a fact that makes them a primary target for authorities looking beyond the paperwork.

The fundamental regulatory question has therefore evolved. It is no longer a matter of whether intra-group transactions exist as a formal matter of contract law; it is whether they warrant fiscal recognition in the specific form claimed by the taxpayer.[6] In this environment, the mere existence of a signed agreement and a transfer pricing study is no longer a terminal point of compliance, it is merely the starting point of a much more aggressive inquiry into economic intent.

IV. Deductibility as the Structural Fault Line

In the current enforcement climate, the most significant pressure point rarely emerges from revenue recognition. Instead, it resides within the mechanics of deductibility.

Intra-group flows, specifically management fees, royalty payments, and consultancy charges, are routinely leveraged as tools for shifting profits. These expenses are deducted in high-tax jurisdictions, while the corresponding income accrues to entities situated in more tax-friendly environments. While an isolated payment may appear modest and technically defensible, the cumulative effect is a significant reallocation of the taxable base. This systemic erosion has prompted a fundamental shift in the investigative methodology of tax authorities.

The focus of contemporary audits has moved beyond the “price-testing” paradigm. Authorities are increasingly bypassing the question of whether a price falls within an arm’s length range to ask a more existential question: Should the underlying transaction have existed at all? Under this new standard of review, the taxpayer is no longer merely required to defend a number; they must defend the very raison d’être of the service provided. If an administration concludes that a rational, independent actor would not have entered into the transaction, regardless of the price, the deduction is disallowed in its entirety.

In this context, a “perfect” transfer pricing study is useless if the transaction it describes is deemed commercially redundant or devoid of economic utility. The “structural fault line” is thus defined by the tension between formal contractual existence and demonstrable economic necessity.

V. Elasticity and Vulnerability: Management, IP, and Consultancy

The shift away from “technical inoculation” defense is most obvious in three categories of intra-group transactions: management, IP, and consultancy. These flows are characterized by a high degree of conceptual elasticity, which creates significant evidentiary gaps during a fiscal or criminal inquiry.

1. Management Services and Elastic Definitions

Management fees are uniquely exposed due to their inherent definitional fluidness. Under the guise of “management,” a group may aggregate strategic direction, operational coordination, regulatory compliance, and branding alignment.[7] However, modern audit standards, and increasingly, criminal prosecutors, apply a three-part test that transcends the arm’s length price:

– The “Rendered Service” Test: Does the transaction move beyond mere accounting entries to reflect a tangible activity?
– The “Benefit” Test: Did the recipient entity derive a specific, identifiable commercial advantage?
– The “Non-Duplication” Test: Does the service replicate functions already performed by local management?

Where documentation relies on generic descriptions or ex post cost allocations, deductibility is denied not because the price is excessive, but because the service itself dissolves under scrutiny.

2. Intellectual Property: The Shift from Legal Title to Value Creation

The tension within royalty structures reflects a broader shift in international tax law: the decoupling of legal ownership from economic substance. Historically, royalty payments were justified by the mere existence of a license agreement and a registered trademark. That formal logic is now largely obsolete.

Authorities now prioritize the DEMPE framework (Development, Enhancement, Maintenance, Protection, and Exploitation). Where IP ownership is detached from the functions of development and the assumption of risk, royalty flows begin to resemble unauthorized profit extraction rather than legitimate remuneration.

The inquiry is no longer whether the royalty rate is “within range,” but whether the economic narrative aligns with the operational reality of where value is actually created.[8]

3. Consultancy as Structural Solvent

Intra-group consultancy represents the most ambiguous terrain in transfer pricing. Because advisory services are inherently intangible and often episodic, they are frequently deployed as planning tools, yet they remain the most fragile legal constructs during a challenge.[9] Practitioners note that documentation shortcomings, especially with intangible, episodic consultancy arrangements, are a common focus of transfer pricing risk alerts, as authorities probe substance over form.[10]

Skepticism from authorities typically crystallizes when consultancy fees:

– Lack identifiable deliverables: No reports, memoranda, or tangible work products exist to evidence the service.
– Replicate internal expertise: The local entity already possesses the professional capacity for which it is purportedly “consulting” the parent.
– Are structural rather than situational: The fees recur with a suspicious regularity that does not correspond to actual operational changes or projects.

When consultancy becomes a permanent fixture of the balance sheet rather than a response to a specific business need, its legal character as a deductible expense collapses, leaving the taxpayer exposed to allegations of artificiality.

VI. Formal Legality and the Common Law Tradition

UK jurisprudence has long rejected the proposition that formal legality immunizes underlying conduct and it was one of the first pioneers in this matter. The logic underpinning the contemporary “single composite transaction” approach is not a modern enforcement aberration; rather, it is deeply embedded in the common law’s inherent suspicion of structures whose legal architecture obscures their operative purpose.

This analytical posture finds its most robust articulation in the criminal context in R v Dimsey; R v Allen.[11] In these proceedings, the Court of Appeal held that arrangements which were individually lawful could nonetheless ground criminal liability when their composite effect revealed a dishonest intent. The court’s focus was directed not at the legality of each discrete step, but at the coherence of the whole. Within this framework, formal compliance does not exculpate the actor; it merely contextualizes the intent.

The same structural reasoning is visible, albeit in civil form, in the landmark decision of Furniss v Dawson.[12] There, the House of Lords famously “collapsed” formally distinct transactions to expose their true economic purpose, establishing that tax consequences must follow the substance of a composite arrangement rather than its artificial components.

What has shifted in the intervening decades is not the analytical method, but the legal stakes. Where the “collapsing” of steps once served primarily to justify fiscal recharacterization, it now serves, in appropriate cases, to support criminal inference. This doctrinal continuity demonstrates that the transition from pricing to purpose, and from form to narrative, is not a contemporary novelty. It is a fundamental constant of the legal system, now operating in a significantly more punitive register.

VI. Transfer Pricing as Evidentiary Terrain

The convergence of tax compliance and criminal analysis is not a modern innovation; it was explicitly recognized at an institutional level as early as the 1980s. The Keith Committee[13] the foundational review of the enforcement powers of the UK revenue departments, expressly rejected any sharp bifurcation between tax administration and criminal enforcement. It emphasized that revenue powers operate within, and contribute to, the broader criminal justice system.

In this doctrinal light, tax compliance was never conceived as a technocratic silo. It has always been evidentiary terrain.

The contemporary treatment of transfer pricing documentation is, therefore, entirely consistent with this tradition. Documentation does not merely record a transaction; it fixes a version of events. Where that version is used to sustain a misleading economic narrative, its probative value does not dissipate, it increases. Under the scrutiny of a prosecutor, the transfer pricing file is no longer a compliance document; it is a primary piece of evidence used to demonstrate the deliberate alignment of legal form against operative reality.

VII. The Tyranny of the Benchmark

This pathology is not unique to the fiscal sphere. In the broader context of corporate governance, the Kay Review[14] diagnosed what it termed the “tyranny of the benchmark”: the systematic substitution of metrics for judgment, and of formal indicators for substantive truth. In such environments, compliance begins to “crowd out” reality, creating a veneer of legitimacy that masks underlying structural failures.

Arm’s length pricing exhibits this exact dynamic. Benchmarks become proxies for legitimacy, and the identification of “comparables” routinely displaces genuine inquiry into the substance of a transaction. Yet, the comfort of numerical alignment often masks a deeper vulnerability. When benchmarks are deployed to stabilize a narrative that is fundamentally divorced from economic substance, they cease to be safeguards. They become artefacts of intent, documents capable of corroborating, rather than dispelling, an inference of dishonesty.[15]

VIII. The retreat from quantitative limits

Recent legislative volatility illustrates this shift with unusual clarity.[16] The introduction, and subsequent rapid elimination, of fixed deductibility caps for intra-group payments, most notably the 1% threshold proposed in Law 239/2025 and adjusted by GO 6/2026[17], reflects a broader retreat from mechanical anti-avoidance tools. While administratively simple, these caps were never elegant; they were blunt instruments that often clashed with treaty principles and OECD standards.

However, their removal does not signal a return to a permissive fiscal era. It signals a sophisticated reallocation of legal risk.

Paradoxically, the liberalization of deductibility actually intensifies criminal exposure. When the law no longer relies on ex ante statutory percentages, scrutiny necessarily migrates ex post. By removing the numeric ceiling, the State relinquishes the ease of automatic disallowance but reserves a far more potent power: discretionary judgment. The focus shifts from the math of “the cap” toward the “darker”[18] questions of recharacterization, abuse-of-law, and criminal intent.

This transition has two critical consequences for the corporate actor:

1. Weakened Legal Anchoring: While taxpayers may technically deduct higher amounts, they do so with significantly less legal certainty. Compliance becomes a qualitative argument rather than a quantitative calculation.

2. The Personalization of Enforcement: Without the “safe harbor” or “hard ceiling” of a cap, the decisive question is no longer how much was deducted, but why. The state is now empowered to investigate whether a transaction reflects a genuine economic reality or a constructed fiscal narrative.

In this evolving landscape, arm’s length pricing no longer functions as the primary line of defense. With the removal of fixed deductibility caps, the assessment of intra-group transactions turns less on identifying a technically defensible number and more on evaluating the transaction’s underlying purpose and economic substance. Transfer pricing documentation, once largely a compliance instrument, now serves as a window into corporate intent, exposing the coherence, or fragility, of the arrangement as a whole.

This transformation reshapes the compliance paradigm. Tax authorities increasingly examine whether the allocation of functions, risks, and assets reflects commercial reality rather than post hoc justification. In that environment, formal benchmarking becomes merely one element of a broader evidentiary framework. The decisive factor is not numerical precision but structural credibility: only arrangements grounded in genuine economic logic can endure regulatory review.


[1] Ivey v Genting Casinos (UK) Ltd [2017] UKSC 67; OECD Transfer Pricing Guidelines (2022), Para 1.122.
[2] OECD Transfer Pricing Guidelines (2022); WT Ramsay Ltd v IRC [1981] STC 174.
[3] Law No. 239/2025 on the General Anti-Abuse Rule, Monitorul Oficial al României, pt. I (Rom.).
[4] KPMG Romania, Achiziții intragrup controversate – inspecții fiscale (Oct. 2018),
[5] Ibid. 2.
[6] Picciotto, S. (2011), Regulating Global Corporate Capitalism; OECD (2022), Chapter I, Section D.1: “Accurate Delineation of the Actual Transaction”.
[7] PwC Romania, Serviciile intragrup – de ce sunt ușor de contestat de către autoritățile fiscale? Ce pot face companiile?(Jan. 18, 2022).
[8] OECD (2022), Chapter VI: Special Considerations for Intangibles (DEMPE).
[9] EY România, Buletin fiscal – EY de prețuri de transfer, 1 iulie 2025.
[10] Ibid. 5.
[11] R v. Dimsey; R v. Allen, [2000] EWCA Crim 121, [2001] 1 Cr. App. R. 19 (Eng.).
[12] Furniss v. Dawson, [1984] A.C. 474 (H.L.) (Eng.).
[13] Enforcement Powers of the Revenue Departments (H.C. 1983–84, Cmnd. 8822) (U.K.).
[14] John Kay, The Kay Review of UK Equity Markets and Long-Term Decision Making (2012) (U.K.).
[15] Braithwaite, J. (2005), Markets in Vice, Markets in Virtue
[16] Emergency Government Ordinance no. 6/2026.
[17]  Ibid. 13.
[18] HMRC Fraud Investigation Service (FIS) Manual, CH27300.


Elena Mihaela Șolcă