PaperRRD

RE:Ghost Rank: Detecting Elliptic Curves with Anomalous Tate-Shafarevich Groups Across 1.9 Million Cremona Curves

RE:Ghost Rank: Detecting Elliptic Curves with Anomalous Tate-Shafarevich Groups Across 1.9 Million Cremona Curves

byAdam MurphyPublished 3/11/2026AI Rating: 3.7/5

A large-scale empirical study of the Birch and Swinnerton-Dyer (BSD) conjecture using 1.9 million elliptic curves from Cremona's database, revealing anomalous Tate-Shafarevich groups and establishing a power-law distribution with exponent α ≈ 2. The work provides numerical verification of BSD identities for rank-0 curves and identifies systematic patterns in Sha group sizes across unprecedented dataset scales.

Top 10% Internal Consistency
View Shareable Review Profile- permanent credential link for endorsements
Revisions Suggested
Internal Consistency4/5

The rebuttal is largely self-consistent in how it reframes claims: it retracts any implication that BSD is “proven,” narrows Kolyvagin’s role to finiteness results, and correctly reclassifies §4.1 as a pipeline consistency check given that Cremona’s |Ш| values are BSD-derived. The remaining tension is the strong statement “Within our dataset, there is no ambiguity” while also limiting the explicit Manin-constant verification to optimal curves with conductor ≤ 500,000 and treating non-optimal curves via “absorbed into Cremona’s tabulated values.” That is not a formal contradiction, but it is an overreach unless the dataset is explicitly restricted or the normalization pathway is made uniform and explicit.

Mathematical Validity2/5

A key mathematical vulnerability remains around normalization: removing c_E^2 from the BSD working formula and declaring “all subsequent equations inherit c_E = 1” is only valid if every curve’s invariants are computed in a manner consistent with c_E=1 (e.g., always using optimal curves within a verified range) or if the inputs are all already Cremona-normalized in a way that makes c_E irrelevant in practice. The rebuttal provides only a partial justification (verified c_E=1 for optimal curves up to a conductor bound; non-optimal curves handled by absorption into tabulated values), which is not sufficient for a 1.9M-curve study without a clear, uniform accounting. Additionally, the rebuttal references key equations ((1), (2), (6)) but does not present them, preventing audit of factors (period conventions across components, Tamagawa numbers, torsion, etc.) and making it difficult to assess whether any multiplicative-factor mistakes could contaminate derived statistics.

Falsifiability4/5

As an empirical paper, the central claims are testable: the reported distributional pattern for |Ш| (including the fitted exponent and sensitivity to cutoffs) and the pipeline-consistency reproduction of Cremona/LMFDB quantities can be independently replicated on the same database with the authors’ code and parameter settings. The work becomes less falsifiable only where it leans on quantities computed via BSD itself (e.g., using tabulated |Ш| to “verify” BSD), but the rebuttal now explicitly frames that as non-independent.

Clarity4/5

The rebuttal is clear about what was changed (language around “proven” vs “verified,” and the intended interpretation of §4.1), and it states the key methodological limitation (only 38 unique |Ш| values for power-law fitting). Clarity is still limited by the absence of the actual revised manuscript text and by not showing the referenced equations and exact normalization conventions in-line, which are central to interpreting the results.

Novelty4/5

The scale (≈1.9 million Cremona curves) and the focus on empirical patterns in anomalous Tate–Shafarevich group sizes across such a large corpus are meaningfully novel as a data-driven contribution, especially if accompanied by reproducible code and careful normalization. The conceptual framing (connections to Delaunay-type heuristics and tail behavior near α≈2) is not wholly new, but the dataset-scale synthesis and empirical characterization plausibly are.

Completeness4/5

The rebuttal addresses each reviewer concern directly, includes concrete action items (language revisions, added paragraph on Manin constant), and acknowledges methodological limitations rather than over-claiming. What remains incomplete for review is (i) the actual revised text and (ii) a rigorous, dataset-wide normalization/implementation description sufficient to rule out multiplicative-factor errors (especially concerning c_E and representative choice within isogeny classes).

Publication criteria: All dimensions must score at least 2/5 with an overall average of 3/5 or higher. The AI recommendation badge above is advisory - publication is determined by the numerical scores.

The rebuttal makes genuinely constructive improvements in scientific positioning. In particular, it corrects the strongest scope issue by removing “proven/theorem” language about BSD exactness and restricting Kolyvagin’s citation to finiteness results, which aligns well with established theory. It also appropriately acknowledges the circularity inherent in using Cremona-tabulated |Ш| values computed via BSD, reframing §4.1 as a pipeline consistency check rather than an independent validation.

Where the submission still appears fragile is the mathematical bookkeeping around normalization—especially the Manin constant. The move to delete c_E^2 from the working BSD equation and then declare that subsequent equations “inherit c_E=1” is a high-impact choice: if even a subset of the 1.9M curves are outside the explicitly verified regime (optimal, conductor ≤ 500k) or if different representatives/models are mixed, the resulting |Ш| or derived statistics can be off by rational factors. The rebuttal gestures at “absorption into Cremona’s tabulated values” for non-optimal curves, but that does not yet read like a dataset-wide proof of consistent normalization; it reads like a plausible practice that still needs to be specified precisely.

On the empirical side, the tail-fit discussion is reasonably cautious: the authors acknowledge that Clauset–Shalizi–Newman-style testing would be preferable and that n=38 unique |Ш| values limits statistical leverage. That restraint is appropriate. However, the headline claim of a power-law with α≈2 will be received as stronger than “notable observation” unless the manuscript very clearly separates (a) descriptive fit on binned/unique values from (b) inferential claims about an underlying distribution.

Overall, this looks like a potentially valuable large-scale empirical study, but it is not yet in a state where a reader can be confident that the key computed quantities are uniformly normalized across the entire dataset. Fixing that would substantially raise confidence in every downstream statistic and would likely upgrade both mathematical validity and the credibility of the empirical findings.

This review was generated by AI for research and educational purposes. It is not a substitute for formal peer review. All analyses are advisory; publication decisions are based on numerical score thresholds.

Key Equations (2)

L(E,1)ΩE+=Sha(E)Tam(E)E(Q)2\frac{L(E,1)}{\Omega_E^+} = \frac{|\text{Sha}(E)| \cdot \text{Tam}(E)}{|E(\mathbb{Q})|^2}

Birch and Swinnerton-Dyer conjecture identity for rank-0 curves with Manin constant c_E = 1

α=2.02±0.07\alpha = 2.02 \pm 0.07

Power-law exponent for Tate-Shafarevich group size distribution from OLS fitting

Testable Predictions (2)

Tate-Shafarevich group sizes follow a power-law distribution with exponent α ≈ 2

mathpending

Falsifiable if: Systematic deviation from power-law behavior in larger datasets or MLE analysis showing significantly different exponent

The exponent α ≈ 2 represents the convergence/divergence threshold for E[|Sha|]

mathpending

Falsifiable if: Theoretical proof that the true exponent differs significantly from 2 or empirical evidence of convergent moments

Tags & Keywords

BSD conjecture(math)elliptic curves(math)large-scale computation(methodology)number theory(math)power-law fitting(methodology)statistical analysis(methodology)Tate-Shafarevich groups(math)

Keywords: Birch and Swinnerton-Dyer conjecture, Tate-Shafarevich groups, elliptic curves, Cremona database, power-law distribution, L-functions, rank-0 curves, Manin constant, numerical verification, Kolyvagin

You Might Also Find Interesting

Semantically similar papers and frameworks on TOE-Share

Finding recommendations...