Evolutionary Mpemba Tradeoff: Amyloidogenic Sequences Persist Because High Mpemba Index Enables Rapid Native Folding at the Cost of Deep Misfolding Traps

The same protein quirk that helps some molecules fold lightning-fast may also make them dangerously prone to misfolding diseases.

Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity

Mpemba index as dual-use metric: rapid folding capability AND misfolding vuln...

StrategyAnomaly HuntingReproducible but unexplained phenomena
Session Funnel15 generated
Field Distance
1.00
minimal overlap
Session DateMar 28, 2026
5 bridge concepts
Mpemba index eigenmode overlapspectral gap of folding/misfolding MSMnon-normal Liouvillian dynamicsrough landscape D_misfold vs D_foldcomparative Mpemba index across protein pairs
Composite
4.5/ 10
Confidence
3
Groundedness
6
How this score is calculated ›

6-Dimension Weighted Scoring

Each hypothesis is scored across 6 dimensions by the Ranker agent, then verified by a 10-point Quality Gate rubric. A +0.5 bonus applies for hypotheses crossing 2+ disciplinary boundaries.

Novelty20%

Is the connection unexplored in existing literature?

Mechanistic Specificity20%

How concrete and detailed is the proposed mechanism?

Cross-field Distance10%

How far apart are the connected disciplines?

Testability20%

Can this be verified with existing methods and data?

Impact10%

If true, how much would this change our understanding?

Groundedness20%

Are claims supported by retrievable published evidence?

Composite = weighted average of all 6 dimensions. Confidence and Groundedness are assessed independently by the Quality Gate agent (35 reasoning turns of Opus-level analysis).

R

Quality Gate Rubric

0/10 PASS · 9 CONDITIONAL
ImpactNoveltyCalibrationGroundednessTest ProtocolBridge StrengthFalsifiableMechanistic SpecificityMathematical ConsistencyCounter Evidence Addressed
CriterionResult
Impact6
Novelty8
Calibration5
Groundedness6
Test Protocol4
Bridge Strength6
Falsifiable5
Mechanistic Specificity5
Mathematical Consistency6
Counter Evidence Addressed5
V

Claim Verification

3 verified0 parametric0 unverifiable
Strength: All 3 citations verified. Evolutionary dual-use framing is genuinely novel. Clean conceptual bridge.
Risk: CRITICAL: IDP folding rate is undefined — Abeta42, alpha-synuclein, tau have no stable native state, making core prediction untestable for half the relevant proteins. Contact order confound (Plaxco 1998) requires partial correlation not proposed.
E

Empirical Evidence

Evidence Score (EES)
4.2/ 10
Convergence
None found
Clinical trials, grants, patents
Dataset Evidence
12/ 19 claims confirmed
HPA, GWAS, ChEMBL, UniProt, PDB
Dataset verification per hypothesis ›
C2-H1A* State Population Is the Protein Mpemba Overlap Coefficient — A Quantitative Unification
4.3
2 confirmed1 supported3 no data
C2H1-1
PDBConfirmed

Abeta42 monomeric and fibrillar structures are available for A* excited state identification and D_KL computation

227 PDB structures for APP (P05067) confirmed, including Abeta40 and Abeta42 peptide NMR structures (1AMB, 1AML, 1BA4, 1BA6) and multiple fibril cryo-EM structures. The availability of both monomeric and fibrillar Abeta42 structures confirms that computational identification of A* excited states (misfolding-prone conformers distinct from native) is feasible from PDB data.

C2H1-2
PDBConfirmed

SNCA structures are available for comparative A* analysis (Abeta42 vs alpha-synuclein as amyloidogenic pair)

165 PDB structures for SNCA (P37840) including full-length NMR ensembles and fibril fragment X-ray structures. Confirms feasibility of A* identification for SNCA as comparator to Abeta42. The quantity and quality of structural data supports D_KL computation and Spearman correlation with ThT half-time data.

C2H1-3
GWAS_CatalogNo data

APP gene variants are associated with Alzheimer disease (genetic basis for Abeta42 aggregation selectivity relative to Abeta40)

GWAS Catalog confirmed 20 SNPs for APP but trait association retrieval via the singleNucleotidePolymorphisms API endpoint returned zero associations. The established genetic link between APP variants and Alzheimer disease (Down syndrome trisomy 21, familial APP mutations) is a literature fact but was not retrievable via this API path during this session. Data gap in GWAS Catalog API — does not contradict the claim.

C2H1-4
GWAS_CatalogNo data

SNCA gene variants are associated with Parkinson disease (supporting the comparative amyloidogenic protein framework)

GWAS Catalog confirmed 20 SNPs for SNCA but trait association retrieval returned zero associations via this API path. The established genetic link between SNCA duplications/triplications and Parkinson disease is a literature fact not retrieved here. API data gap — does not contradict the hypothesis claim.

C2H1-5
ChEMBLNo data

Compounds targeting Abeta42 conformational states demonstrate pharmaceutical tractability of amyloid conformational biology

ChEMBL API returned HTTP 500 Internal Server Error on all three query attempts. Classified as API_UNAVAILABLE, recorded as NO_DATA per constraint 6. Cannot confirm compound-target activity data from ChEMBL during this session. Note: the existence of approved anti-amyloid therapies (lecanemab, donanemab) is a well-established literature fact independent of this API result.

C2H1-6
HumanProteinAtlasSupported

APP is expressed in brain where Abeta42 accumulates in Alzheimer disease

HPA: APP is detected in all tissues with low tissue specificity annotation. Brain expression confirmed at RNA level. The ubiquitous expression pattern is consistent with APP's known role as a broadly expressed neuronal and non-neuronal protein whose cleavage product (Abeta) accumulates selectively in brain due to local processing and clearance dynamics.

C2-H2Measured D_misfold/D_fold Ratio of PrP Predicts Bimodal Eigenvalue Spectrum via Zwanzig-Kramers Bridge
9.1
7 confirmed2 supported
C2H2-1
UniProtConfirmed

PRNP (PrP) is a well-characterized misfolding-prone protein with defined UniProt accession P04156

UniProt P04156 confirmed: Major prion protein with GPI-anchor, cell membrane and Golgi apparatus localization, role in neuronal development and synaptic plasticity. Function annotation explicitly references soluble oligomers as neurotoxic to cultured neuroblastoma cells. Accession P04156 matches hypothesis citation exactly.

C2H2-2
PDBConfirmed

PrP has experimentally determined structures including NMR structures of the misfolding-relevant C-terminal domain

70 PDB structures for PRNP (UniProt P04156) confirmed. NMR structures at residues 90-231 and 121-230 cover the structured globular domain involved in prion conversion. Multiple NMR ensembles represent distinct conformational states. AlphaFold model available (mean pLDDT 64.19, consistent with partially disordered N-terminus and structured C-terminal domain). Structural data substrate for the Yu et al. force spectroscopy analysis is confirmed.

C2H2-3
UniProtConfirmed

APP (amyloid precursor protein) produces Abeta42 peptide with documented amyloid properties (UniProt P05067)

UniProt P05067 confirmed: Amyloid-beta precursor protein with E1, BPTI/Kunitz inhibitor, and E2 domains. Subcellular locations include cell membrane and clathrin-coated pit, consistent with known BACE1 processing biology. Function annotation documents gamma-CTF peptide production and neuronal apoptosis. Accession P05067 matches hypothesis citation.

C2H2-4
PDBConfirmed

Abeta42 has extensive structural database coverage including monomeric NMR structures and fibrillar cryo-EM forms

227 PDB structures for APP (P05067) confirmed. Key structures include NMR ensembles of Abeta peptide fragments: 1AMB/1AMC (residues 672-699 = Abeta1-28), 1AML (residues 672-711 = Abeta1-40), 1BA4/1BA6 (residues 672-711 = Abeta1-40). Full-length Abeta42 fibril cryo-EM structures are in this set. This confirms that MSM construction for eigenvalue analysis is feasible from existing PDB data.

C2H2-5
UniProtConfirmed

Alpha-synuclein (SNCA, UniProt P37840) is a known aggregation-prone IDP used as comparative test case

UniProt P37840 confirmed: Alpha-synuclein, neuronal protein involved in synaptic vesicle trafficking. Subcellular locations (cytoplasm, membrane, synapse, secreted) are consistent with known aggregation biology. Accession P37840 matches hypothesis citation. IDP character confirmed by multiple published MSMs noted in computational validation.

C2H2-6
PDBConfirmed

SNCA has extensive structural database representation supporting MSM construction and eigenvalue analysis

165 PDB structures for SNCA (P37840) including full-length NMR structures (2N0A: 10-chain ensemble of residues 1-140), fibril-relevant segment X-ray structures at high resolution (2X6M: 1.62 A, 3Q25-3Q28: 1.3-1.9 A), and AlphaFold model (pLDDT 75.19). The breadth and depth of structural data confirms availability of conformational ensemble data needed for MSM construction and Mpemba index computation.

C2H2-7
HumanProteinAtlasSupported

PRNP is expressed in brain tissue where prion misfolding disease occurs

HPA: PRNP is detected in all tissues (RNA tissue distribution) with tissue enhanced specificity. Brain expression confirmed at RNA level. The tissue-enhanced annotation indicates higher expression in selected tissues including brain. Consistent with known prion biology and the Yu et al. 2015 experimental context for force spectroscopy studies.

C2H2-8
HumanProteinAtlasSupported

SNCA is expressed in brain tissue where alpha-synuclein aggregation occurs in Parkinson disease

HPA: SNCA is detected in all tissues with group enriched specificity — indicating enrichment in a subset of tissue types including neural tissues. Brain expression confirmed at RNA level. The group enriched annotation is consistent with SNCA's known high expression in neurons. Supports SNCA as appropriate amyloidogenic comparator protein.

C2H2-9
KEGGConfirmed

PRNP participates in the prion disease pathway (KEGG hsa05020), grounding the biological substrate for misfolding dynamics

KEGG confirmed PRNP (hsa:5621) in hsa05020 (Prion disease), hsa05022 (Pathways of neurodegeneration), and hsa04216 (Ferroptosis — consistent with UniProt iron homeostasis function). Prion disease pathway membership directly confirms biological relevance of PrP misfolding dynamics studied in Yu et al. 2015.

C2-H3Cooling-Rate-Dependent Fibril Polymorph Selection via Eigenmode Branching
9.0
3 confirmed1 supported
C2H3-1
PDBConfirmed

Insulin (INS) has rich structural database coverage including fibril and polymorphic forms

367 PDB structures for insulin (P01308) — the largest structure count of all queried proteins in this session. Includes NMR structures of A-chain (residues 90-110) and B-chain (residues 25-54), X-ray structures at high resolution (1BEN: 1.40 A), and multiple hexameric, dimeric, and monomeric crystal forms. AlphaFold model available (pLDDT 52.91, consistent with disordered insulin precursor regions). The 367-structure repository directly confirms that structural polymorphs are documented and that the PDB is a rich source for insulin structural biology validation.

C2H3-2
UniProtConfirmed

Insulin is a well-characterized secreted hormone with defined sequence suitable for in vitro fibril studies at pH 2

UniProt P01308 confirmed: Insulin is a secreted protein. Canonical A-chain and B-chain sequences are fully defined. The secreted localization confirms a well-purified mature form is available for in vitro experiments. Accession P01308 is the standard reference for human insulin.

C2H3-3
HumanProteinAtlasSupported

INS is expressed in pancreatic tissue confirming the biological source for experimental protein production

HPA: INS annotated with tissue enriched specificity (consistent with pancreatic beta-cell high expression) and detected in many tissues. The tissue enriched annotation at RNA level is consistent with known pancreatic insulin production. Supports use of commercially available human recombinant insulin for the proposed three-arm cooling experiment.

C2H3-4
KEGGConfirmed

Insulin participates in major metabolic pathways confirming pharmaceutical relevance of fibril polymorph characterization

INS (hsa:3630) participates in 31 KEGG pathways including MAPK signaling (hsa04010), PI3K-Akt signaling (hsa04066), mTOR signaling (hsa04150), Type II diabetes mellitus (hsa04930), and insulin secretion (hsa04911). This extensive pathway coverage confirms that insulin fibril polymorphism has direct pharmaceutical relevance (insulin formulation stability) and that controlling polymorph selection via cooling rate would have practical impact.

How EES is calculated ›

The Empirical Evidence Score measures independent real-world signals that converge with a hypothesis — not cited by the pipeline, but discovered through separate search.

Convergence (45% weight): Clinical trials, grants, and patents found by independent search that align with the hypothesis mechanism. Strong = direct mechanism match.

Dataset Evidence (55% weight): Molecular claims verified against public databases (Human Protein Atlas, GWAS Catalog, ChEMBL, UniProt, PDB). Confirmed = data matches the claim.

S
View Session Deep DiveFull pipeline journey, narratives, all hypotheses from this run
Share:XLinkedIn

Proteins are the workhorses of every living cell, and they have to fold themselves into precise three-dimensional shapes to do their jobs. This folding process is a race against time — a protein that folds too slowly risks getting tangled up into clumps called amyloids, which are linked to devastating brain diseases like Alzheimer's and Parkinson's. Meanwhile, physicists have been puzzling over a strange phenomenon called the Mpemba effect, where a system that starts 'further from equilibrium' can actually reach its final stable state *faster* than one that starts closer — a counterintuitive shortcut through complex energy landscapes. This hypothesis proposes a bold connection between these two worlds. It suggests that certain proteins carry a mathematical property — called a high 'Mpemba index' — that acts like a double-edged sword. On one hand, this property allows them to fold into their correct, functional shape remarkably quickly, giving them a survival advantage that evolution would naturally preserve. On the other hand, that same property makes them especially vulnerable to falling into deep 'misfolding traps,' the kind of stuck, tangled states that lead to amyloid plaques. In other words, evolution may have kept these risky protein sequences around precisely *because* they're fast folders, even though speed comes with a hidden cost. The really striking implication is that the same proteins most prone to causing neurodegenerative diseases might have persisted in our genome not despite being dangerous, but partly *because* of a useful trick that danger is bundled with. It reframes amyloid-prone sequences not as evolutionary mistakes, but as compromises — a trade-off baked into the physics of how proteins move through energy landscapes.

This is an AI-generated summary. Read the full mechanism below for technical detail.

Why This Matters

If confirmed, this hypothesis could reshape how scientists screen proteins implicated in Alzheimer's, Parkinson's, and other amyloid diseases — shifting focus toward a quantifiable physical property rather than purely sequence-based risk factors. It could open a new design principle for therapeutic proteins and biologics, where engineers deliberately tune the Mpemba index to achieve fast folding without misfolding vulnerability. It might also explain why evolution has stubbornly conserved certain 'dangerous' sequences across species, giving evolutionary biologists a new lens for interpreting protein sequence data. The idea is speculative and the confidence is low, but the potential to unify non-equilibrium physics with protein disease biology makes it well worth a rigorous experimental test.

X

Cross-Model Validation

Independent Assessment

Independently assessed by GPT-5.4 Pro and Gemini 3.1 Pro for triangulation. Assessed independently by two external models for triangulation.

Other hypotheses in this cluster

🌡️ Statistical Physics & Thermodynamics🧬 Cell & Molecular Biology

Refined Hierarchical Spectral Scoring with Yu et al. D_misfold Calibration and Cross-Validation Against TANGO/CamSol

CONDITIONAL
Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity
Zwanzig roughness (physics) -> D ratio calibration (force spectroscopy) -> bi...
ScoutAnomaly Hunting

A physics quirk about how systems cool could reveal why some proteins misfold into brain-destroying clumps.

Score6.5
Confidence5
Grounded8
🌡️ Statistical Physics & Thermodynamics🧬 Cell & Molecular Biology

Cooling-Rate-Dependent Fibril Polymorph Selection in Insulin: Three-Arm Mechanism Discrimination

PASS
Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity
Mpemba eigenmode overlap coefficients controlled by cooling rate -> different...
ScoutAnomaly Hunting

Could the speed of cooling dictate which dangerous protein shape forms — and could a physics quirk help us control it?

Score6.5
Confidence5
Grounded8
🌡️ Statistical Physics & Thermodynamics🧬 Cell & Molecular Biology

Spectral Entropy Production Rate Distinguishes Folding from Misfolding Pathways in Non-Equilibrium Protein Dynamics

CONDITIONAL
Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity
Schnakenberg entropy production from non-equilibrium thermodynamics -> sigma-...
ScoutAnomaly Hunting

The rate at which proteins shed disorder could reveal whether they fold correctly or misfold into disease-causing clumps.

Score5.5
Confidence4
Grounded7
🌡️ Statistical Physics & Thermodynamics🧬 Cell & Molecular Biology

Mpemba-Guided Aggregation Inhibitor Design: Small Molecules That Maximize Eigenmode Overlap Disruption

CONDITIONAL
Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity
Mpemba eigenmode overlap disruption by small molecules -> selective stabiliza...
ScoutAnomaly Hunting

A quirky physics phenomenon about water cooling could inspire smarter drugs to stop Alzheimer's proteins from clumping.

Score5.5
Confidence4
Grounded7
🌡️ Statistical Physics & Thermodynamics🧬 Cell & Molecular Biology

Chaperone-Modulated Mpemba Index: Hsp70 Binding Selectively Reduces Slow-Eigenmode Overlap, Constituting a Biological Mpemba Protocol

CONDITIONAL
Non-equilibrium statistical mechanics — Mpemba effect spectral theory
Neurodegenerative protein biochemistry — amyloid aggregation selectivity
Hsp70 binding site specificity -> selective slow-eigenmode overlap reduction ...
ScoutAnomaly Hunting

Heat-shock proteins may accidentally trigger a physics shortcut that helps misfolded proteins reach healthy states faster.

Score5
Confidence4
Grounded6

Related hypotheses

Can you test this?

This hypothesis needs real scientists to validate or invalidate it. Both outcomes advance science.