paper Review Profile
RE:Ghost Rank: Detecting Elliptic Curves with Anomalous Tate-Shafarevich Groups Across 1.9 Million Cremona Curves
A large-scale empirical study of the Birch and Swinnerton-Dyer (BSD) conjecture using 1.9 million elliptic curves from Cremona's database, revealing anomalous Tate-Shafarevich groups and establishing a power-law distribution with exponent α ≈ 2. The work provides numerical verification of BSD identities for rank-0 curves and identifies systematic patterns in Sha group sizes across unprecedented dataset scales.
Read the Full BreakdownThe rebuttal is largely self-consistent in how it reframes claims: it retracts any implication that BSD is “proven,” narrows Kolyvagin’s role to finiteness results, and correctly reclassifies §4.1 as a pipeline consistency check given that Cremona’s |Ш| values are BSD-derived. The remaining tension is the strong statement “Within our dataset, there is no ambiguity” while also limiting the explicit Manin-constant verification to optimal curves with conductor ≤ 500,000 and treating non-optimal curves via “absorbed into Cremona’s tabulated values.” That is not a formal contradiction, but it is an overreach unless the dataset is explicitly restricted or the normalization pathway is made uniform and explicit.
A key mathematical vulnerability remains around normalization: removing c_E^2 from the BSD working formula and declaring “all subsequent equations inherit c_E = 1” is only valid if every curve’s invariants are computed in a manner consistent with c_E=1 (e.g., always using optimal curves within a verified range) or if the inputs are all already Cremona-normalized in a way that makes c_E irrelevant in practice. The rebuttal provides only a partial justification (verified c_E=1 for optimal curves up to a conductor bound; non-optimal curves handled by absorption into tabulated values), which is not sufficient for a 1.9M-curve study without a clear, uniform accounting. Additionally, the rebuttal references key equations ((1), (2), (6)) but does not present them, preventing audit of factors (period conventions across components, Tamagawa numbers, torsion, etc.) and making it difficult to assess whether any multiplicative-factor mistakes could contaminate derived statistics.
As an empirical paper, the central claims are testable: the reported distributional pattern for |Ш| (including the fitted exponent and sensitivity to cutoffs) and the pipeline-consistency reproduction of Cremona/LMFDB quantities can be independently replicated on the same database with the authors’ code and parameter settings. The work becomes less falsifiable only where it leans on quantities computed via BSD itself (e.g., using tabulated |Ш| to “verify” BSD), but the rebuttal now explicitly frames that as non-independent.
The rebuttal is clear about what was changed (language around “proven” vs “verified,” and the intended interpretation of §4.1), and it states the key methodological limitation (only 38 unique |Ш| values for power-law fitting). Clarity is still limited by the absence of the actual revised manuscript text and by not showing the referenced equations and exact normalization conventions in-line, which are central to interpreting the results.
The scale (≈1.9 million Cremona curves) and the focus on empirical patterns in anomalous Tate–Shafarevich group sizes across such a large corpus are meaningfully novel as a data-driven contribution, especially if accompanied by reproducible code and careful normalization. The conceptual framing (connections to Delaunay-type heuristics and tail behavior near α≈2) is not wholly new, but the dataset-scale synthesis and empirical characterization plausibly are.
The rebuttal addresses each reviewer concern directly, includes concrete action items (language revisions, added paragraph on Manin constant), and acknowledges methodological limitations rather than over-claiming. What remains incomplete for review is (i) the actual revised text and (ii) a rigorous, dataset-wide normalization/implementation description sufficient to rule out multiplicative-factor errors (especially concerning c_E and representative choice within isogeny classes).
The rebuttal makes genuinely constructive improvements in scientific positioning. In particular, it corrects the strongest scope issue by removing “proven/theorem” language about BSD exactness and restricting Kolyvagin’s citation to finiteness results, which aligns well with established theory. It also appropriately acknowledges the circularity inherent in using Cremona-tabulated |Ш| values computed via BSD, reframing §4.1 as a pipeline consistency check rather than an independent validation. Where the submission still appears fragile is the mathematical bookkeeping around normalization—especially the Manin constant. The move to delete c_E^2 from the working BSD equation and then declare that subsequent equations “inherit c_E=1” is a high-impact choice: if even a subset of the 1.9M curves are outside the explicitly verified regime (optimal, conductor ≤ 500k) or if different representatives/models are mixed, the resulting |Ш| or derived statistics can be off by rational factors. The rebuttal gestures at “absorption into Cremona’s tabulated values” for non-optimal curves, but that does not yet read like a dataset-wide proof of consistent normalization; it reads like a plausible practice that still needs to be specified precisely. On the empirical side, the tail-fit discussion is reasonably cautious: the authors acknowledge that Clauset–Shalizi–Newman-style testing would be preferable and that n=38 unique |Ш| values limits statistical leverage. That restraint is appropriate. However, the headline claim of a power-law with α≈2 will be received as stronger than “notable observation” unless the manuscript very clearly separates (a) descriptive fit on binned/unique values from (b) inferential claims about an underlying distribution. Overall, this looks like a potentially valuable large-scale empirical study, but it is not yet in a state where a reader can be confident that the key computed quantities are uniformly normalized across the entire dataset. Fixing that would substantially raise confidence in every downstream statistic and would likely upgrade both mathematical validity and the credibility of the empirical findings.
Strengths
- +Corrected overstatement about BSD being “proven”; claims are now framed as numerical verification/consistency rather than theorem-level results
- +Explicit recognition that reproducing BSD identities using BSD-derived tabulated |Ш| is not an independent test, and sensible framing of §4.1 as a pipeline integrity check
- +Large-scale empirical scope (1.9M curves) with potentially novel aggregate patterns in |Ш| behavior
- +Appropriately cautious language around power-law fitting given limited unique |Ш| support
Areas for Improvement
- -Provide a dataset-wide, unambiguous normalization/implementation specification: exactly which curve representative is used (optimal vs non-optimal; within isogeny class), which quantities come from Cremona vs recomputed, and how factors (c_E, Ω_E conventions, Tamagawa numbers, torsion) are handled consistently
- -Do not assert “no ambiguity” unless the paper proves the needed conditions hold for the full dataset; otherwise qualify the claim (e.g., ‘under Cremona’s conventions’ or ‘for the subset with …’)
- -Include the explicit forms of the referenced equations (1), (2), (6) and define all symbols so readers can audit multiplicative factors
- -Strengthen the distributional analysis presentation: clearly separate descriptive fits from inferential statements; consider adding robustness checks beyond R² (even if CSN09 MLE/KS is underpowered, you can still report alternative fits, bootstrap stability, or Bayesian model comparisons with clear caveats)
- -Discuss computational error propagation and numerical precision choices (L(E,1) evaluation, period computation, rounding/thresholding for |Ш|) and how these could affect tail counts
Share this Review
Post your AI review credential to social media, or copy the link to share anywhere.
theoryofeverything.ai/review-profile/paper/82edb350-3026-474e-9754-1367ae7d3279This review was conducted by TOE-Share's multi-agent AI specialist pipeline. Each dimension is independently evaluated by specialist agents (Math/Logic, Sources/Evidence, Science/Novelty), then synthesized by a coordinator agent. This methodology is aligned with the multi-model AI feedback approach validated in Thakkar et al., Nature Machine Intelligence 2026.
TOE-Share — theoryofeverything.ai