Excess Entropy Production as a Candidate Universal Cost of Persistence: A Thermodynamic Foundation for the Attractor Framework; Robert Galida (July 2026) [F]

Abstract

Every dissipative system maintains its attractor through continuous reconfiguration. Reconfiguration requires work; work generates entropy. The recovery rate κκ — corrective permeability — is the rate at which a system reconfigures to return to its attractor after perturbation. This paper proposes that κκ is a measure of excess entropy generation rate.

We develop an abstract persistence cost framework and prove its equivalence to Lyapunov theory. We then identify entropy production as a physical realization of this cost, deriving:κ=infxδ(x)0σexcess(ϕt(x))dtκ=xinf​∫0∞​σexcess​(ϕt​(x))dtδ(x)​

where σexcess=σσssσexcess​=σσss​ is the excess entropy production rate above the system’s steady-state baseline. For physical systems, the baseline is zero (equilibrium); for biological, cognitive, and social systems, the baseline is the steady-state dissipation rate of the healthy, well-coordinated attractor.

This unifies physical, biological, cognitive, and social systems. The framework is grounded in the second law of thermodynamics and non-equilibrium steady-state thermodynamics, not analogy. Empirical predictions are provided for each domain.

Keywords: entropy generation, excess entropy production, corrective permeability, attractor framework, dissipative structures, reconfiguration, Lyapunov theory, free energy principle, allostatic load


1. Introduction

The attractor framework defines persistence as the ability of a system to maintain its attractor under perturbation. Historically, persistence has been measured kinematically — as distance traveled or time spent away from equilibrium. This paper proposes that the true cost of persistence is thermodynamic: it is the excess entropy generated during reconfiguration and recovery.

Every dissipative system maintains its attractor through continuous reconfiguration. A bacterium reconfigures its metabolism to maintain homeostasis. A brain reconfigures its synaptic connections to maintain predictive models. A society reconfigures its institutions to maintain order. Reconfiguration requires work; work generates entropy. The second law of thermodynamics applies at every level of organization.

We develop an abstract persistence cost framework first, establishing its equivalence to Lyapunov theory. We then identify entropy production as a physical realization of this cost, deriving the relationship between corrective permeability and excess entropy generation.

The framework unifies physical, biological, cognitive, and social systems. It is grounded in the second law of thermodynamics and non-equilibrium steady-state thermodynamics, not analogy.


2. The Persistence Cost Functional

Let XX be a state space, ϕt(x)ϕt​(x) the flow of a dynamical system, and AXA⊆X an attractor set. Let δ(x)=d(x,A)δ(x)=d(x,A) be the distance from xx to the attractor. For a treatment of state-space constraints in viability theory, see Aubin (1991).

Definition 1 (Persistence Cost Functional): A persistence cost functional C(x)C(x) is a scalar function on XX satisfying:

  1. C(x)0C(x)≥0 for all xx
  2. C(x)=0C(x)=0 if and only if xAx∈A
  3. C(ϕt(x))L1([0,))C(ϕt​(x))∈L1([0,∞)) for all xx in the basin

Definition 2 (Cumulative Persistence Cost): For a finite horizon T>0T>0:DT(x)=0TC(ϕt(x))dtDT​(x)=∫0TC(ϕt​(x))dt

For trajectories that converge to the attractor:D(x)=0C(ϕt(x))dtD∞​(x)=∫0∞​C(ϕt​(x))dt


3. Existence and Lyapunov Equivalence

Theorem 1 (Existence of the Persistence Functional): Assume C(x)0C(x)≥0, C=0C=0 only on AA, and C(ϕt(x))L1([0,))C(ϕt​(x))∈L1([0,∞)) for all xx in the basin. Assume ff is locally Lipschitz, the flow is continuously differentiable in the initial condition, and CC is continuous and locally bounded. Then:

  1. D(x)=0C(ϕt(x))dtD∞​(x)=∫0∞​C(ϕt​(x))dt exists and is finite.
  2. DD∞​ is continuous.
  3. DD∞​ satisfies the transport equation:

D(x)f(x)=C(x)D∞​(x)⋅f(x)=−C(x)

Proof: The integral exists and is finite by the L1L1 assumption. Continuity follows from the dominated convergence theorem under the stated regularity assumptions. To derive the transport equation, compute:D(ϕh(x))=hC(ϕt(x))dt=D(x)0hC(ϕt(x))dtD(ϕh​(x))=∫h∞​C(ϕt​(x))dt=D(x)−∫0hC(ϕt​(x))dt

Then:D(ϕh(x))D(x)h=1h0hC(ϕt(x))dtC(x)hD(ϕh​(x))−D(x)​=−h1​∫0hC(ϕt​(x))dt→−C(x)

as h0h→0. By the chain rule:D(x)f(x)=C(x)D(x)⋅f(x)=−C(x)

Corollary (Equivalence to Lyapunov Theory): Any Lyapunov function V(x)V(x) (with V0V≥0, V=0V=0 on the attractor, and V˙0V˙≤0) yields a persistence cost C(x)=V˙(x)C(x)=−V˙(x). Conversely, any persistence cost C(x)C(x) satisfying Df=CDf=−C defines a Lyapunov function D(x)D(x).

Proof: If VV is a Lyapunov function, then V˙=Vf0V˙=∇Vf≤0. Define C=V˙C=−V˙. Then C0C≥0, C=0C=0 on the attractor, and DT=C=V(x)V(ϕT(x))DT​=∫C=V(x)−V(ϕT​(x)). Conversely, if Df=CDf=−C, then D˙=C0D˙=−C≤0, so DD is a Lyapunov function.

Interpretation: The persistence cost framework is mathematically equivalent to classical Lyapunov stability theory. For the connection to contraction analysis, see Lohmiller & Slotine (1998). For control Lyapunov functions, see Freeman & Kokotovic (1996). Entropy production is one physically meaningful realization of the cost function CC. For a detailed treatment of Lipschitz continuity of DD∞​ under a Lipschitz-flow hypothesis, see Galida (2026a), Proposition 4.


4. Entropy Production as Persistence Cost

4.1 Entropy Balance

For an open system, the entropy balance equation is:dSsystemdt=σΦdtdSsystem​​=σ−Φ

where σ0σ≥0 is the entropy production rate (always non-negative by the second law) and ΦΦ is the entropy export rate to the environment. For foundational treatments of stochastic thermodynamics and entropy production, see Seifert (2012) and Sekimoto (2010).

For a system in a steady state:dSsystemdt=0    σ=ΦdtdSsystem​​=0⟹σ

4.2 Excess Entropy Production

Define the steady-state entropy production rate σssσss​ as the rate when the system is at its attractor.

Define the excess entropy production rate:σexcess(x)=σ(x)σss(x)σexcess​(x)=σ(x)−σss​(x)

Assumption (Excess Entropy Decay): For all trajectories in the basin, there exist constants C<C<∞ and μ>0μ>0 such that:σexcess(ϕt(x))Ceμtσexcess(x)σexcess​(ϕt​(x))≤Ceμtσexcess​(x)

for all t0t≥0. This ensures D(x)<D∞​(x)<∞ and is the standard hypothesis under which the persistence functional and its associated bounds are well-defined, consistent with Galida (2026a, 2026b). The decay rate μμ may be domain-specific and is empirically measurable.

Note on generalization: The exponential decay assumption is adopted here to ensure finiteness of DD∞​ and to maintain consistency with the prior papers in this series. Generalization to L1L1 integrable decays (e.g., algebraic) is a priority for future work.

4.3 The Entropy Persistence Functional

Definition 3 (Cumulative Excess Entropy Functional): For a finite horizon T>0T>0:DT(x)=0Tσexcess(ϕt(x))dtDT​(x)=∫0Tσexcess​(ϕt​(x))dt

For trajectories that converge to the attractor:D(x)=0σexcess(ϕt(x))dtD∞​(x)=∫0∞​σexcess​(ϕt​(x))dt

Interpretation: The persistence functional is the total excess entropy generated during reconfiguration and recovery.

4.4 Corrective Permeability

Definition 4 (Corrective Permeability):κ=infxBAδ(x)D(x)κ=x∈B∖Ainf​D∞​(x)δ(x)​

where δ(x)=d(x,A)δ(x)=d(x,A) is the distance to the attractor.

Interpretation: κκ is the minimum excess entropy cost per unit distance. It measures the efficiency of reconfiguration: a system that returns with minimal excess entropy generation has high κκ; a system that generates excess entropy has low κκ.


4.5 Basin Depth

Proposition 1 (Properties of Basin Depth): Define B=D(saddle)B=D∞​(saddle), where saddlesaddle is the lowest point on the basin boundary (the separatrix between attractors). For the connection to large-deviation theory and escape rates, see Freidlin & Wentzell (2012). Then:

  1. B0B≥0, with equality iff the basin has no barrier (i.e., the boundary coincides with the attractor).
  2. For gradient systems x˙=V(x)x˙=−∇V(x), B=V(saddle)V(A)B=V(saddle)−V(A) (the classical energy barrier).
  3. BB is invariant under smooth coordinate changes (coordinate invariance).
  4. BB depends on the chosen persistence cost functional CC; different costs yield different barriers.

Proof: (1) follows from non-negativity of DD∞​. (2) follows from the transport equation Df=CDf=−C and the identity f=Vf=−∇V. (3) follows from the invariance of the integral under diffeomorphisms. (4) is self-evident.


5. Domain-Specific Realizations

5.1 Physical Systems: Thermodynamic Excess Entropy

For a thermodynamic system, S(x)=kBlogΩ(x)S(x)=kB​logΩ(x), where Ω(x)Ω(x) is the number of microstates. For an isolated system, σss=0σss​=0 (equilibrium), so σexcess=σ=S˙σexcess​=σ=S˙.κ=infxδ(x)S(A)S(x)κ=xinf​S(A)−S(x)δ(x)​

Example: A gas returning to equilibrium after compression. The entropy generated is ΔS=nRlog(Vf/Vi)ΔS=nRlog(Vf​/Vi​).

5.2 Biological Systems: Metabolic Excess Entropy

For a biological system, S(x)S(x) is the metabolic entropy. The baseline σssσss​ is the resting metabolic rate (homeostasis). The excess is:σexcess=metabolic rateresting metabolic rateσexcess​=metabolic rate−resting metabolic rateκ=infxδ(x)0σexcess(ϕt(x))dtκ=xinf​∫0∞​σexcess​(ϕt​(x))dtδ(x)​

Example: A cell returning to homeostasis after a nutrient shock. The excess entropy generated is the metabolic cost of restoring homeostasis above baseline. For the dissipative-structures framework underlying biological self-organization, see Nicolis & Prigogine (1989).

5.3 Cognitive Systems: Free Energy Dissipation

For a cognitive system, variational free energy F=logp(yx)+DKL[q()p(x)]F=−logp(yx)+DKL​[q(⋅)∥p(⋅∣x)] is adopted here as one candidate persistence functional. We do not claim variational free energy is uniquely correct; it is adopted as the most developed existing candidate persistence functional for cognitive systems. Other candidates (Bayesian surprise, expected free energy, predictive information) are possible; this paper focuses on FF due to its established role in the free-energy principle (Friston, 2010). For the thermodynamics of information and its connection to free-energy minimization, see Parrondo, Horowitz & Sagawa (2015) and Sagawa & Ueda (2008).

The baseline σssσss​ is the baseline neural dissipation rate (resting brain activity). The excess is:σexcess=F˙F˙ssσexcess​=F˙−F˙ssκ=infxδ(x)0σexcess(ϕt(x))dtκ=xinf​∫0∞​σexcess​(ϕt​(x))dtδ(x)​

Example: A cognitive system updating its beliefs after a prediction error. The excess entropy generated is the free energy dissipated during belief updating above baseline.

5.4 Social Systems: Coordination Excess Entropy

For a social system, define the aggregate social entropy production rate as:σsocial(t)=i(S˙i(t)S˙irest)σsocial(t)=i∑​(S˙i​(t)−S˙irest​)

where S˙i(t)S˙i​(t) is the total entropy production rate of individual ii, and S˙irestS˙irest​ is the individual’s baseline entropy production rate in a resting, minimally socially constrained state. This is measured via physiological proxies such as basal metabolic rate, resting allostatic load, or cortisol baseline (McEwen, 1998; Sterling & Eyer, 1988).

Interpretation: σsocialσsocial measures the excess dissipation attributable to social constraints: the additional entropy generated by coordination, communication, conflict, norm enforcement, and institutional friction.

Non-Negativity: Unlike total entropy production S˙i0S˙i​≥0 (which follows from the second law), σisocialσisocial​ is not guaranteed to be non-negative. Division of labor, infrastructure, and specialization may reduce an individual’s metabolic burden relative to a solitary baseline. The hypothesis is that during recovery from social disruption, σisocial0σisocial​≥0; in steady-state, σisocial0σisocial​→0. This is an empirical claim, not a theorem.

The baseline σssσss​ is the steady-state social entropy production rate (well-coordinated society). The excess is:σexcess=σsocialσssσexcess​=σsocial−σssκ=infxδ(x)0σexcess(ϕt(x))dtκ=xinf​∫0∞​σexcess​(ϕt​(x))dtδ(x)​

Example: A society recovering from a shock (economic crisis, political upheaval). The excess entropy generated is the coordination cost of restructuring above baseline. A harmonious society has σexcess=0σexcess​=0; a turbulent society has σexcess>0σexcess​>0; a chronically turbulent society may have settled into a new attractor with a higher σssσss​. This illustrates the framework’s central distinction: the attractor is the state of minimum entropy generation for that class of system.


6. The Unified Framework

6.1 Summary Table

Domain Entropy Functional Baseline σssσss Excess σexcessσexcess​ Recovery Rate κκ
Physical Thermodynamic entropy 0 (equilibrium) S˙S˙ infδΔSinfΔSδ
Biological Metabolic entropy Resting metabolic rate Metabolic rate — resting infδσexcessdtinf∫σexcess​dtδ
Cognitive Free energy Baseline neural dissipation F˙F˙ssF˙−F˙ss infδσexcessdtinf∫σexcess​dtδ
Social Social entropy production Steady-state social dissipation σsocialσssσsocial−σss infδσexcessdtinf∫σexcess​dtδ

6.2 The Universal Structure

Every domain follows the same mathematical structure:

Component Expression
Excess entropy production σexcess(x)=σ(x)σssσexcess​(x)=σ(x)−σss
Cumulative cost D(x)=0σexcess(ϕt(x))dtD∞​(x)=∫0∞​σexcess​(ϕt​(x))dt
Recovery rate κ=infxδ(x)/D(x)κ=infxδ(x)/D∞​(x)
Basin depth B=D(saddle)B=D∞​(saddle)
Transport equation Df=σexcessDf=−σexcess​

6.3 The Low-Energy Attractor Benchmark (Proposed Hypothesis)

We propose the following benchmark as an additional hypothesis: the attractor is the state of minimum entropy generation for that class of system.

Domain Attractor Entropy Generation at Attractor
Physical Equilibrium σ=0σ=0
Biological Homeostasis σ=σss>0σ=σss​>0 (resting metabolism)
Cognitive Settled Belief σ=σss>0σ=σss​>0 (baseline neural dissipation)
Social Coordinated Order σ=σss>0σ=σss​>0 (baseline institutional friction)

Interpretation:

  1. For equilibrium systems (gases, isolated systems), the attractor is the state where entropy generation reaches zero — the system has nowhere lower to go.
  2. For dissipative systems (cells, brains, societies), the attractor is the state where entropy generation reaches its lowest non-zero steady-state value — the minimum entropy generation the system can sustain while maintaining its functional organization.

Important caveats:

  • This is a proposed benchmark, not a derived theorem.
  • For cognitive systems in particular, minimizing entropy production rate (a thermodynamic quantity) and minimizing free energy/surprise (the actual claim in the free-energy principle) are distinct minimization principles. The framework does not establish a bridge between them; this is an open question.
  • The benchmark is an empirical hypothesis that requires domain-specific validation.

In all cases, the attractor is the lowest entropy-generating state that system can have while remaining itself.


7. Testable Predictions

7.1 Core Prediction

Prediction: The recovery rate κκ is inversely proportional to the excess entropy generated during reconfiguration:κ1DκD∞​1​

Falsification: If a system returns to its attractor with high excess entropy generation but high recovery rate, the prediction is falsified.

7.2 Secondary Prediction

Prediction: Systems that maintain their attractor with minimal excess entropy generation are more “efficient.” Systems that generate excess entropy are “inefficient” or “stressed.”

Falsification: If an inefficient system has lower excess entropy generation than an efficient system, the prediction is falsified.

7.3 Domain-Specific Predictions

Domain Prediction Falsification
Physical κκ correlates with thermal efficiency κκ high but efficiency low
Biological κκ correlates with metabolic efficiency κκ high but metabolic cost high
Cognitive κκ correlates with learning efficiency κκ high but learning cost high
Social κκ correlates with institutional efficiency κκ high but coordination cost high

8. Experimental Design

8.1 Physical Systems

  • System: Gas in a piston
  • Perturbation: Compression
  • Measurement: Excess entropy generation (heat measurement) and recovery time
  • Test: Correlation between κκ and 1/D1/D∞​

8.2 Biological Systems

  • System: Cell culture
  • Perturbation: Nutrient shock
  • Measurement: Metabolic rate above resting (oxygen consumption) and recovery time
  • Test: Correlation between κκ and metabolic cost

8.3 Cognitive Systems

  • System: Human participants in a learning task
  • Perturbation: Prediction error
  • Measurement: Free energy dissipation above baseline (EEG complexity, pupil dilation) and belief updating rate
  • Test: Correlation between κκ and free energy dissipation

8.4 Social Systems

  • System: Institutional response to shocks
  • Perturbation: Economic or political crisis
  • Measurement: Social entropy production above baseline (allostatic load, cortisol, institutional friction) and recovery time
  • Test: Correlation between κκ and social entropy production

9. Open Questions

Question Status Difficulty
Q1: Uniqueness of S(x)S(x) Are there multiple valid entropy functionals for a given domain? Hard
Q2: Variational principle Is there a universal variational principle that yields S(x)S(x)? Hard
Q3: Social second law Does σsocial0σsocial≥0 always hold during recovery? Very Hard
Q4: Cross-level entropy How does entropy generation at one level relate to entropy generation at another? Hard
Q5: Measurement Can we measure excess entropy generation in cognitive and social systems directly? Moderate
Q6: Unification Can all domain-specific entropy functionals be derived from a single universal functional? Very Hard

10. Conclusion

Every dissipative system maintains its attractor through continuous reconfiguration. Reconfiguration requires work; work generates excess entropy. The recovery rate κκ — corrective permeability — is the rate at which a system reconfigures to return to its attractor after perturbation. We have proposed that κκ is a measure of excess entropy generation rate.

We developed an abstract persistence cost framework and proved its equivalence to Lyapunov theory. We then identified entropy production as a physical realization of this cost, deriving:κ=infxδ(x)0σexcess(ϕt(x))dtκ=xinf​∫0∞​σexcess​(ϕt​(x))dtδ(x)​

where σexcess=σσssσexcess​=σσss​ is the excess entropy production rate above the system’s steady-state baseline — thermodynamic entropy for physical systems, metabolic entropy for biological systems, free energy dissipation for cognitive systems, and social entropy production for social systems.

We proposed a unified benchmark: the attractor is the state of minimum entropy generation for that class of system — zero for equilibrium systems, non-zero steady-state for dissipative systems. This provides a unified criterion for identifying attractors across domains: an attractor is a state from which the system cannot reduce its entropy generation further without losing its defining structure or function.

This unifies physical, biological, cognitive, and social systems. In each domain, persistence requires reconfiguration; reconfiguration generates excess entropy; κκ measures the entropy cost of that reconfiguration. The framework is grounded in the second law of thermodynamics and non-equilibrium steady-state thermodynamics, not analogy.

Social Application: The framework provides a thermodynamic interpretation of social dynamics: harmony is a low-entropy attractor state; turbulence is a high-entropy state generated by excess dissipation during reconfiguration. The recovery rate κκ measures how efficiently a society transitions from turbulence back to harmony — that is, how quickly it reduces its excess entropy production to zero.


11. Limitations

This paper establishes an abstract persistence cost framework with a proposed thermodynamic realization. Several limitations should be explicitly acknowledged:

  1. Uniqueness. Entropy production is not proved to be the unique persistence cost. Many positive functionals C(x)C(x) satisfy Df=CDf=−C. The identification of entropy production as the canonical cost is a physically motivated hypothesis, not a mathematical theorem.
  2. Scope. The framework does not imply that all domains obey thermodynamics literally. The cognitive and social realizations are proposed hypotheses requiring empirical validation.
  3. Decay assumption. Exponential decay of σexcessσexcess​ is a sufficient assumption to ensure finiteness of DD∞​, not a necessary one. Generalization to L1L1 integrable decays (e.g., algebraic) is a priority for future work.
  4. Basin depth. Basin depth B=D(saddle)B=D∞​(saddle) is defined in terms of the persistence cost functional. Its relationship to classical energy barriers is established only for gradient systems.
  5. Empirical validation. The predictions of the framework — particularly the inverse relationship between κκ and DD∞​ — remain to be tested empirically across domains.
  6. Low-energy attractor benchmark. The benchmark proposed in §6.3 is a hypothesis, not a derived theorem. For cognitive systems, it risks conflating thermodynamic entropy production with free-energy minimization — distinct principles whose relationship remains open.

References

Aubin, J. P. (1991). Viability Theory. Birkhäuser.

Boltzmann, L. (1877). “Über die Beziehung zwischen dem zweiten Hauptsatz der mechanischen Wärmetheorie und der Wahrscheinlichkeitsrechnung.” Wiener Berichte, 76, 373-435.

Clausius, R. (1865). “Über verschiedene für die Anwendung bequeme Formen der Hauptgleichungen der mechanischen Wärmetheorie.” Annalen der Physik, 125(7), 353-400.

Freeman, R. A., & Kokotovic, P. V. (1996). Robust Nonlinear Control Design: State-Space and Lyapunov Techniques. Birkhäuser.

Freidlin, M. I., & Wentzell, A. D. (2012). Random Perturbations of Dynamical Systems (3rd ed.). Springer.

Friston, K. (2010). “The free-energy principle: a unified brain theory?” Nature Reviews Neuroscience, 11(2), 127-138.

Galida, R. (2026a). “The Persistence Functional: A Candidate Formal Foundation for the Attractor Framework.” Fantasy Attractor.

Galida, R. (2026b). “Deriving Corrective Permeability from the Cumulative Deviation Functional.” Fantasy Attractor.

Jaynes, E. T. (1957). “Information Theory and Statistical Mechanics.” Physical Review, 106(4), 620-630.

Khalil, H. K. (2002). Nonlinear Systems (3rd ed.). Prentice Hall.

Kondepudi, D., & Prigogine, I. (1998). Modern Thermodynamics: From Heat Engines to Dissipative Structures. Wiley.

Lohmiller, W., & Slotine, J. J. E. (1998). “On contraction analysis for non-linear systems.” Automatica, 34(6), 683-696.

Lyapunov, A. M. (1892). The General Problem of the Stability of Motion.

McEwen, B. S. (1998). “Stress, Adaptation, and Disease: Allostasis and Allostatic Load.” Annals of the New York Academy of Sciences, 840(1), 33-44.

Nicolis, G., & Prigogine, I. (1989). Exploring Complexity: An Introduction. W. H. Freeman.

Parrondo, J. M. R., Horowitz, J. M., & Sagawa, T. (2015). “Thermodynamics of information.” Nature Physics, 11(2), 131-139.

Prigogine, I. (1947). Étude Thermodynamique des Phénomènes Irréversibles. Dunod.

Prigogine, I., & Nicolis, G. (1977). Self-Organization in Non-Equilibrium Systems. Wiley.

Sagawa, T., & Ueda, M. (2008). “Second law of thermodynamics with discrete quantum feedback control.” Physical Review Letters, 100(8), 080403.

Seifert, U. (2012). “Stochastic thermodynamics, fluctuation theorems and molecular machines.” Reports on Progress in Physics, 75(12), 126001.

Sekimoto, K. (2010). Stochastic Energetics. Springer.

Shannon, C. E. (1948). “A Mathematical Theory of Communication.” Bell System Technical Journal, 27(3), 379-423.

Sterling, P., & Eyer, J. (1988). “Allostasis: A New Paradigm to Explain Arousal Pathology.” In Handbook of Life Stress, Cognition and Health, 629-649.


Suggested citation: Galida, R. S. (2026). Excess Entropy Production as a Candidate Universal Cost of Persistence: A Thermodynamic Foundation for the Attractor Framework. Fantasy Attractor.




The Persistence Functional: A Candidate Formal Foundation for the Attractor Framework; Robert Galida (July 2026) [F]

Abstract

The attractor framework provides a domain-general vocabulary for describing persistence and change across physical, biological, cognitive, and social systems. However, its core variables—κκ (corrective permeability), BB (basin depth), and RR (reality alignment)—have been defined inconsistently across application papers, and their formal relationships have remained implicit. This paper proposes a candidate mathematical formalization for the framework.

The central mathematical innovation of this paper is treating persistence as a functional defined over trajectories—DT(x)=0Td(ϕτ(x),A)dτDT​(x)=∫0Td(ϕτ​(x),A)dτ—rather than as a scalar property of states. We prove several mathematical properties of DTDT​, including non-negativity, monotonicity in TT, additivity, Lipschitz continuity with respect to initial conditions, and a bound relating DD∞​ to the recovery rate κκD(x)Cκd(x,A)D∞​(x)≤κCd(x,A). We establish connections to dynamic programming and ergodic theory via occupation measures. We introduce a complementary topological persistence functional Ptopo(t)Ptopo​(t), which measures the lifetime of topological features in the trajectory’s state-space geometry, and the topological evolution rate E(t)E(t).

We unify the framework’s variable set: κκ is the recovery rate (operationalized as 1/τ1/τ); γγ is a proposed drift rate for persistent chaos, grounded in the literature on high-dimensional neural networks; BB is the energy barrier (basin depth); B~B~ is a complementary persistence depth; RR is the expected log predictive likelihood. We propose testable predictions linking E(t)E(t) to κκ and γγ, and provide a falsifiable experimental protocol using neural network training and persistent homology.

The paper offers a candidate formal foundation, with explicit definitions, mathematical properties, and empirical grounding. All unverified sources are clearly labeled as such.

Keywords: attractor framework, persistence functional, cumulative deviation, topological persistence, corrective permeability, basin depth, reality alignment, persistent homology


1. Introduction

The attractor framework has been applied across physics (hydrogen decay, Jeans instability), biology (ECM mechanics, HRV), cognition (belief updating, performance attractors), and social systems (religious attractors, civilizational dynamics). A common vocabulary has emerged: κκ (corrective permeability), BB (basin depth), and RR (reality alignment). However, these variables have been defined inconsistently across papers, and their formal relationships have remained implicit. This paper proposes a candidate mathematical formalization that addresses these inconsistencies.

The central mathematical innovation of this paper is treating persistence as a functional defined over trajectories rather than as a scalar property of states. DT(x)=0Td(ϕτ(x),A)dτDT​(x)=∫0Td(ϕτ​(x),A)dτ can be understood as a type of action functional (carefully qualified). Like the classical action L(q,q˙)dtL(q,q˙​)dt, it assigns a scalar to an entire trajectory, is additive under concatenation, and suggests variational and optimal-control interpretations. However, it is not the mechanical action; it is a cumulative deviation functional that measures time away from equilibrium. This moves the framework into the domain of trajectory-level analysis, aligning it with modern dynamical systems and geometric control theory.

We introduce the cumulative deviation functional DT(x)DT​(x) as this central object, and we establish its mathematical properties, including its relationship to the recovery rate κκ. We introduce a complementary topological persistence functional Ptopo(t)Ptopo​(t) and the topological evolution rate E(t)E(t). We unify the framework’s variable set with operational definitions and propose testable predictions with falsification criteria.

1.1 Scope and Status

This paper is a candidate formalization—it provides definitions, mathematical properties, and empirical hypotheses. It is not a completed empirical validation; that is the subject of future work. All claims are labeled as definitions (part of the formal structure), propositions/theorems (proved), hypotheses (testable predictions), or heuristics (suggestive connections not yet formalized). This distinction is maintained throughout.


2. Formal Definitions

Let XX be a metric space with distance function ∥⋅∥. Let ϕτ(x)ϕτ​(x) be the flow of a dynamical system starting from state xXx∈X at time τ=0τ=0. Let AXA⊆X be an attractor set (a compact, invariant set to which trajectories converge). Assume the flow is continuous and measurable so that d(ϕτ(x),A)d(ϕτ​(x),A) is measurable. The flow ϕτϕτ​ satisfies the semigroup property ϕt+s=ϕtϕsϕt+s​=ϕt​∘ϕs​ for all t,s0t,s≥0, with ϕ0=idϕ0​=id. We assume d(ϕτ(x),A)L1([0,T])d(ϕτ​(x),A)∈L1([0,T]) for all finite TT, so the integral defining DTDT​ is well-defined.

Define the distance from a point to the attractor:d(x,A)=infaAxad(x,A)=a∈Ainf​∥xa

The definition applies to any metric space; for infinite-dimensional spaces, the usual measurability and integrability conditions are assumed.

2.1 Cumulative Deviation Functional

Definition 1 (Cumulative Deviation Functional): For a finite horizon T>0T>0, the cumulative deviation functional is:DT(x)=0Td(ϕτ(x),A)dτDT​(x)=∫0Td(ϕτ​(x),A)dτ

Interpretation: DT(x)DT​(x) is the total accumulated deviation from the attractor over the interval [0,T][0,T]. It measures integrated error, residence-time-weighted distance, or accumulated regret. This is not a path length; it measures time spent away from equilibrium, whereas path length ϕ˙τ(x)dτ∫∥ϕ˙​τ​(x)∥dτ measures distance traveled.

Domain generality: This definition applies to any system with a well-defined state space, a flow, and an attractor set. It does not require linearity, differentiability, or specific functional forms.

Empirical note: DTDT​ is the fundamental object for empirical work; DD∞​ is primarily an analytical limit used for theoretical bounds.

Note: DTDT​ is not a Lyapunov function. A Lyapunov function is a scalar function of the current state; DTDT​ is a functional of the entire trajectory. It does not decrease monotonically along trajectories, and it does not provide pointwise stability information. Its purpose is to measure accumulated history, not instantaneous energy.

Occupation measure connection: Define the occupation measure of the trajectory up to time TT as:μT(B)=0T1B(ϕτ(x))dτμT​(B)=∫0T1B​(ϕτ​(x))dτ

for measurable BXB⊆X. Then:DT(x)=Xd(y,A)dμT(y)DT​(x)=∫X​d(y,A)dμT​(y)

Thus DTDT​ is the expected distance to the attractor under the occupation measure. This connects the functional directly to ergodic theory and occupation measure analysis. For foundational treatments of occupation measures and invariant measures, see Ruelle (1989) and Bowen (1975).


2.1.1 Why the L¹ Trajectory Functional?

The choice of the L¹ integral over alternatives is motivated by the following properties:

  • Linearity: Each moment contributes equally; accumulation is additive over time.
  • Physical units: For systems with a natural distance metric, DTDT​ has units of distance × time, which is interpretable as accumulated deviation.
  • Simplicity: It is the simplest nontrivial trajectory functional that is not a path length.
  • Analogy: It mirrors cumulative regret and occupation measures in control theory and ergodic theory.
  • Avoidance of overweighting: Unlike d2d2, it does not disproportionately weight large deviations; unlike max, it is sensitive to the full trajectory.

This is one natural choice; other functionals (e.g., dpdp, exponentially weighted integrals) could be substituted without changing the framework’s structure.


2.2 Topological Persistence Functional

Let Xτ={ϕs(x):s[0,τ]}Xτ​={ϕs​(x):s∈[0,τ]} be the trajectory segment up to time ττ. Let PHk(Xτ)PHk​(Xτ​) be the kk-dimensional persistent homology of the point cloud XτXτ​ at scale ϵϵ. Each feature (component, loop, void) has a birth scale bb and a death scale dd, with persistence dbdb. For foundational treatments of persistent homology, see Edelsbrunner & Harer (2010) or Carlsson (2009).

Definition 2 (Topological Persistence Functional): We define the following complementary topological persistence functional. For t0t≥0:Ptopo(t)=0tk0(b,d)PHk(Xτ)(db)dτPtopo​(t)=∫0tk≥0∑​(b,d)∈PHk​(Xτ​)∑​(db)dτ

The map τPHk(Xτ)τ↦PHk​(Xτ​) is piecewise constant on intervals where the trajectory does not cross a homology-critical threshold. Assuming the trajectory crosses such thresholds at discrete times, the integral is well-defined as a sum of piecewise continuous segments. This is the standard assumption in time-varying persistent homology (see Carlsson & Zomorodian, 2009).

Interpretation: Ptopo(t)Ptopo​(t) is the total lifetime of all topological features in the trajectory’s state-space geometry up to time tt. This is a separate mathematical object from DTDT​; the relationship between them is an empirical hypothesis. This is one possible choice among several topological summaries (e.g., persistence landscapes, persistence images) and is selected because it mirrors the cumulative interpretation of DTDT​, rather than because it is uniquely canonical. Other stable summaries—such as persistence landscapes, persistence images, or Betti curves—could be substituted for the present functional without changing the framework’s structure.

Measurement: In practice, Ptopo(t)Ptopo​(t) is computed by sampling the trajectory at discrete times, computing persistent homology on latent activation manifolds, and summing the persistence of all features using standard libraries (e.g., GUDHI, Ripser). Turner & Barak (2023) demonstrated that trained RNNs develop attractors sequentially during training; the topological structure of these attractors can be analyzed using persistent homology.

Falsification: If persistent homology features do not correlate with any behavioral or dynamical measure in a given system, PtopoPtopo​ is not a useful construct for that domain.


2.3 Topological Evolution Rate

Definition 3 (Topological Evolution Rate): For a learning system with time-dependent topological persistence, the topological evolution rate is defined as:E(t)=ddtPtopo(t)E(t)=dtdPtopo​(t)

where differentiable, and experimentally as E(t)ΔPtopoΔtE(t)≈ΔtΔPtopo​​ over finite intervals.

Interpretation: E(t)E(t) measures how quickly the system’s topological complexity changes during learning. Negative E(t)E(t) indicates topological simplification (compression); positive E(t)E(t) indicates increasing complexity (expansion); E(t)0E(t)≈0 indicates stagnation. Learning is one possible cause of topological change; random drift, noise, or chaotic wandering can also change topology.

Empirical anchor: Karuppiah, Nazreen Banu et al. (2026) examine the evolution of topological signatures during training. Turner & Barak (2023) show that RNNs develop attractors sequentially, which may correspond to phases of topological simplification. We hypothesize that successful learning corresponds to negative average values of E(t)E(t) over defined phases, but this is a testable claim, not a definition.


3. Mathematical Properties of the Cumulative Deviation Functional

This section establishes the mathematical behavior of DTDT​, providing the foundation for its use in the framework.

3.1 Non-negativity

Proposition 1 (Non-negativity): For any xXx∈X and any T0T≥0:DT(x)0DT​(x)≥0

with equality iff ϕτ(x)Aϕτ​(x)∈A for almost all τ[0,T]τ∈[0,T].

Proof: The integrand is a distance function d(ϕτ(x),A)d(ϕτ​(x),A), which is non-negative by definition. The integral of a non-negative function is non-negative. Equality holds only if the integrand is zero almost everywhere.


3.2 Monotonicity in TT

Proposition 2 (Monotonicity): For fixed xxDT(x)DT​(x) is monotonically non-decreasing in TT:DT2(x)DT1(x)for T2T1DT2​​(x)≥DT1​​(x)for T2​≥T1​

Proof: For T2T1T2​≥T1​:DT2(x)=0T1d(ϕτ(x),A)dτ+T1T2d(ϕτ(x),A)dτDT2​​(x)=∫0T1​​d(ϕτ​(x),A)dτ+∫T1​T2​​d(ϕτ​(x),A)dτ

The second integral is non-negative by Proposition 1. Therefore DT2(x)DT1(x)DT2​​(x)≥DT1​​(x).

Corollary: If the trajectory converges exactly to the attractor at time τ0<Tτ0​<T, then:DT(x)=Dτ0(x)for all Tτ0DT​(x)=Dτ0​​(x)for all Tτ0​


3.3 Additivity

Proposition 3 (Additivity): For any T,S0T,S≥0:DT+S(x)=DT(x)+DS(ϕT(x))DT+S​(x)=DT​(x)+DS​(ϕT​(x))

Proof:DT+S(x)=0T+Sd(ϕτ(x),A)dτ=0Td(ϕτ(x),A)dτ+TT+Sd(ϕτ(x),A)dτ=DT(x)+0Sd(ϕτ+T(x),A)dτ=DT(x)+0Sd(ϕτ(ϕT(x)),A)dτ(by the semigroup property)=DT(x)+DS(ϕT(x))DT+S​(x)​=∫0T+Sd(ϕτ​(x),A)dτ=∫0Td(ϕτ​(x),A)dτ+∫TT+Sd(ϕτ​(x),A)dτ=DT​(x)+∫0Sd(ϕτ+T​(x),A)dτ=DT​(x)+∫0Sd(ϕτ​(ϕT​(x)),A)dτ(by the semigroup property)=DT​(x)+DS​(ϕT​(x))​

This connects DTDT​ naturally to Bellman equations, dynamic programming, and occupation measures.


3.4 Heuristic Connection: Dynamic Programming

The additivity property DT+S(x)=DT(x)+DS(ϕT(x))DT+S​(x)=DT​(x)+DS​(ϕT​(x)) suggests a natural connection to dynamic programming. For a controlled system X˙=f(X,u)X˙=f(X,u) with control uUu∈U, the value function V(x)=infuD(x)V(x)=infuD∞​(x) would formally satisfy the Hamilton-Jacobi-Bellman equation:0=infu{d(x,A)+V(x)f(x,u)}0=uinf​{d(x,A)+∇V(x)⋅f(x,u)}

This is a standard result for additive cost functionals. A full derivation for the specific functional DTDT​ is left for future work. This section is a heuristic connection, not a formal result.


3.5 Lipschitz Continuity with Respect to Initial Conditions

Proposition 4 (Lipschitz Continuity of DTDT​): Suppose the flow ϕτϕτ​ is Lipschitz continuous in xx with constant LL, i.e., ϕτ(x)ϕτ(y)eLτxyϕτ​(x)−ϕτ​(y)∥≤exy∥. Then for any x,yx,y in the basin of AA:DT(x)DT(y)0TeLτdτxy=eLT1LxyDT​(x)−DT​(y)∣≤∫0Tedτxy∥=LeLT−1​∥xy

Proof: First, note that the distance function d(,A)d(⋅,A) is 1-Lipschitz: for any x,yXx,y∈X,d(x,A)d(y,A)xyd(x,A)−d(y,A)∣≤∥xy

This follows from the triangle inequality and the definition of the infimum. Then, using the Lipschitz property of the flow:DT(x)DT(y)0Td(ϕτ(x),A)d(ϕτ(y),A)dτ0Tϕτ(x)ϕτ(y)dτ0TeLτxydτ=eLT1LxyDT​(x)−DT​(y)∣​≤∫0T​∣d(ϕτ​(x),A)−d(ϕτ​(y),A)∣dτ≤∫0T​∥ϕτ​(x)−ϕτ​(y)∥dτ≤∫0Texydτ=LeLT−1​∥xy∥​

Interpretation: This proposition guarantees that empirical estimates of DTDT​ are robust under small perturbations of initial conditions and establishes that DTDT​ defines a continuous functional on the basin of attraction. This is essential for numerical estimation and experimental measurement.


3.6 Instantaneous Growth Rate

Remark 1 (Instantaneous Growth Rate): If the integrand d(ϕτ(x),A)d(ϕτ​(x),A) is continuous in ττ, then:ddTDT(x)=d(ϕT(x),A)dTdDT​(x)=d(ϕT​(x),A)

This follows directly from the Fundamental Theorem of Calculus.


3.7 Ergodic Limit

Proposition 5 (Ergodic Limit): Suppose the normalized occupation measure νT=μT/TνT​=μT​/T converges weakly to an invariant probability measure μμ as TT→∞. Then:limT1TDT(x)=Xd(y,A)dμ(y)T→∞lim​T1​DT​(x)=∫X​d(y,A)dμ(y)

Proof: From the occupation measure representation DT(x)=d(y,A)dμT(y)=Td(y,A)dνT(y)DT​(x)=∫d(y,A)dμT​(y)=Td(y,A)dνT​(y), weak convergence of νTνT​ to μμ and boundedness/continuity of d(,A)d(⋅,A) gives the result.

This is the pointwise ergodic theorem applied to the observable d(,A)d(⋅,A). For the ergodic theory of dynamical systems, see Bowen (1975) and Ruelle (1989).


3.8 Bound under Exponential Stability

Theorem 2 (Bound under Exponential Stability): Suppose the flow ϕτ(x)ϕτ​(x) converges to the attractor AA with exponential rate κ>0κ>0:d(ϕτ(x),A)Ceκτd(x,A)d(ϕτ​(x),A)≤Ceκτd(x,A)

for some constant C<C<∞, for all τ0τ≥0. Then:D(x)=0d(ϕτ(x),A)dτCκd(x,A)D∞​(x)=∫0∞​d(ϕτ​(x),A)dτκCd(x,A)

Proof:D(x)=0d(ϕτ(x),A)dτ0Ceκτd(x,A)dτD∞​(x)=∫0∞​d(ϕτ​(x),A)dτ≤∫0∞​Ceκτd(x,A)dτ=Cd(x,A)0eκτdτ=Cκd(x,A)=Cd(x,A)∫0∞​eκτdτ=κCd(x,A)

Corollary: For linearly stable systems with recovery rate κκD(x)1κd(x,A)D∞​(x)≤κ1​d(x,A) (when C=1C=1).

Important: Exponential stability implies D<D∞​<∞. The converse is not claimed; polynomial convergence can also yield finite DD∞​.


3.9 Recovery Rate Bound

Corollary 1 (Recovery Rate Bound): For a system satisfying the exponential stability hypothesis with constant CC, the recovery rate κκ satisfies:κCd(x,A)D(x)κD∞​(x)Cd(x,A)​

For systems with C=1C=1 (e.g., normal/symmetric linearizations with no transient overshoot), this reduces to:κd(x,A)D(x)κD∞​(x)d(x,A)​

Proof: From Theorem 2, we have D(x)Cκd(x,A)D∞​(x)≤κCd(x,A). Rearranging gives κCd(x,A)D(x)κD∞​(x)Cd(x,A)​. When C=1C=1, this reduces to κd(x,A)D(x)κD∞​(x)d(x,A)​.

Interpretation: Small cumulative deviation implies rapid recovery (large κκ). Large cumulative deviation implies slow recovery (small κκ). This formalizes the intuitive link between DTDT​ and κκ. The CC factor accounts for possible transient overshoot in non-normal systems.


3.10 Finite Horizon Approximation

Proposition 6 (Finite Horizon): For any ϵ>0ϵ>0, there exists a finite TϵTϵ​ such that for all T>TϵT>Tϵ​:DT(x)D(x)ϵDT​(x)−D∞​(x)∣≤ϵ

Proof: This follows directly from Theorem 2 under the exponential stability hypothesis. Since the integrand decays exponentially, the tail integral Td(ϕτ(x),A)dτT∞​d(ϕτ​(x),A)dτ can be made arbitrarily small by choosing TT sufficiently large.


3.11 Summary of Properties

Property Statement
Non-negativity DT(x)0DT​(x)≥0
Monotonicity DT2(x)DT1(x)DT2​​(x)≥DT1​​(x) for T2T1T2​≥T1​
Additivity DT+S(x)=DT(x)+DS(ϕT(x))DT+S​(x)=DT​(x)+DS​(ϕT​(x))
Lipschitz continuity ( D_T(x) – D_T(y) \leq \frac{e^{LT} – 1}{L} |x – y| )
Instantaneous growth ddTDT(x)=d(ϕT(x),A)dTdDT​(x)=d(ϕT​(x),A)
Ergodic limit limT1TDT(x)=d(y,A)dμ(y)limT→∞​T1​DT​(x)=∫d(y,A)dμ(y)
Exponential stability implies finite D∞D∞​ D(x)Cκd(x,A)D∞​(x)≤κCd(x,A)
Recovery bound (general) κCd(x,A)D(x)κD∞​(x)Cd(x,A)​
Recovery bound (C=1) κd(x,A)D(x)κD∞​(x)d(x,A)​
Finite horizon approximation DT(x)D(x)DT​(x)→D∞​(x) as TT→∞

4. The Unified Variable Set

The following variables are defined operationally. Where a variable is a proposal, that is stated explicitly.

4.1 Corrective Permeability (κκ)

Definition 4 (Corrective Permeability): κκ is the recovery rate of the system to its attractor after a small perturbation. Operationally estimated as κ=1/τκ=1/τ under approximately exponential relaxation, where ττ is the characteristic recovery time constant. This coincides with the exponential convergence exponent in the linearized regime and is consistent with the original definition in the attractor framework.

Relationship to DTDT​: From Corollary 1, for a system with initial deviation d(x,A)d(x,A), κCd(x,A)D(x)κD∞​(x)Cd(x,A)​.

Note on κ’s status: In this paper, κ is treated as a primitive empirical regime parameter. A stronger theory would derive κ from DTDT​ and system geometry; this remains an open direction for future work.


4.2 Drift Rate (γγ) — A Proposed Distinction

Definition 5 (Drift Rate): We propose the following operational distinction between dynamical regimes, based on the dominant Lyapunov exponent λmaxλmax​:

Regime λmaxλmax​ κκ γγ Behavior
Stable attractor <0.01<−0.01 >0>0 00 Converges to fixed point
Persistent chaos 0≈0 0≈0 >0>0 Wanders without convergence
Full chaos >0>0 undefined >0>0 Diverges

Thresholds: λmax<0.01λmax​<−0.01, λmax0.01λmax​∣≤0.01, and λmax>0.01λmax​>0.01 (pre-registered, measured in units of 1/epoch). These numerical thresholds are illustrative defaults rather than theoretically privileged constants.

Grounding: This distinction is inspired by the literature on chaos in high-dimensional neural networks (Engelken, Wolf & Abbott, 2023; Sompolinsky, Crisanti & Sommers, 1988; Clark, Abbott & Litwin-Kumar, 2023; Fournier & Urbani, 2023). For the treatment of stochastic and random perturbations, see Arnold (1998).

Falsification: If κκ and γγ are perfectly correlated (i.e., systems with small κκ always have small γγ), the distinction is not useful.


4.3 Basin Depth (BB) and Persistence Depth (B~B~)

Definition 6a (Basin Depth — Energy Barrier): BB is the energy barrier required to escape the basin, measured as the potential difference between the attractor and the saddle point on the basin boundary:B=V(saddle)V(attractor)B=V(saddle)−V(attractor)

This preserves the original definition from earlier papers.

Definition 6b (Persistence Depth): As a complementary measure, we define:B~=minxBDT(x)B~=x∈∂Bmin​DT​(x)

This is the cumulative deviation required to reach the basin boundary. The relationship between BB and B~B~ remains an open mathematical question.

Operational alternative: In practice, the basin boundary may not be well-defined. Estimate BB via the Arrhenius relationship PescapeeB/TPescape​∝eB/T, where TT is the noise level.


4.4 Reality Alignment (RR)

Definition 7 (Reality Alignment): RR is the expected log predictive likelihood:R=E[logp(yX)]R=E[logp(yX)]

where p(yX)p(yX) is the system’s predictive distribution over outcomes yy given state XX. Higher RR indicates better predictive accuracy. This is a standard measure of predictive performance; the label “reality alignment” is a philosophical interpretation.

Direction-dependence: The framework interprets RR as potentially direction-dependent: RABRBARAB​=RBA​. This captures the asymmetry found in Berglund et al. (2024), where models trained on “A is B” fail to generalize to “B is A.” This interpretation is a framework-level claim.

Note on integration: Among the core variables, RR is the least integrated with the trajectory-based formalism. Unlike κκBB, and B~B~, which are directly derived from or related to DTDT​, RR is imported from Bayesian statistics. A more complete theoretical derivation of RR from the same dynamical principles—perhaps as an information-theoretic functional of the occupation measure—remains an open direction for future work.


5. Theoretical Framework

5.1 Relationship Between DTDT​, PtopoPtopo​, and E(t)E(t)

Functional What It Measures Regime
DT(x)DT​(x) Cumulative deviation from attractor All systems
Ptopo(t)Ptopo​(t) Topological feature lifetime Systems with topological structure
E(t)E(t) Rate of topological change Learning systems

Hypothesis: In learning systems, DTDT​ and PtopoPtopo​ are positively correlated early in learning and negatively correlated late in learning. Turner & Barak (2023) demonstrate that RNNs develop attractors sequentially during training, which may correspond to phases of topological simplification. This is a testable prediction.


5.2 Relationship Between κκγγ, and E(t)E(t)

Hypothesis: In a learning system, the topological evolution rate E(t)E(t) is monotonically related to κκ only if the system is not in persistent chaos: E/κ>0E/∂κ>0 (with EE and κκ measured on appropriate scales) in convergent regimes. In persistent chaos, E(t)E(t) is monotonically related to γγE/γ>0E/∂γ>0. Correlation analysis provides a statistical test of these monotonicity relationships.


5.3 Adaptive Landscape (Heuristic Note)

The adaptive landscape V(X,t)V(X,t) evolves as:V˙=g(X,V)λV+ξ(t)V˙=g(X,V)−λV+ξ(t)

For gradient systems with X˙=XV(X)X˙=−∇XV(X), and assuming the dynamics remain within the basin where higher-order nonlinearities are negligible, the cumulative deviation functional can be approximated as:DT(x)0TXV(ϕτ(x),τ)dτDT​(x)≈∫0T​∥∇XV(ϕτ​(x),τ)∥dτ

This is a local heuristic. A full derivation and integration into the core formalism is left for future work.


6. Testable Predictions

6.1 Core Prediction

Prediction: In a learning system, E(t)E(t) is monotonically related to κκ in convergent regimes: E/κ>0E/∂κ>0 (with EE and κκ measured on appropriate scales), and E/γ>0E/∂γ>0 in persistent chaos. Correlation analysis provides a statistical test of this monotonicity:Corr(E(t),κ)>0    λmax<0Corr(E(t),κ)>0⟺λmax​<0Corr(E(t),γ)>0    λmax0Corr(E(t),γ)>0⟺λmax​≈0

Falsification: If E(t)E(t) correlates with κκ in all regimes, or with γγ in all regimes, the prediction is falsified.


6.2 Secondary Prediction

Prediction: In systems with high RRDTDT​ and PtopoPtopo​ are negatively correlated late in learning; in systems with low RR, they are uncorrelated or positively correlated.

Falsification: If DTDT​ and PtopoPtopo​ are negatively correlated in both high-R and low-R systems, the prediction is falsified.


6.3 Boundary Condition and Global Falsifier

Conjecture: We conjecture that the framework applies to any system satisfying:

  • A. Well-defined state space.
  • B. Subject to perturbations.
  • C. Exhibits at least one identifiable attractor.
  • D. Dynamics are observable and measurable.

Global Falsifier: The unified ontology claim collapses if a system is found where DTDT​, κκ, and topological persistence are mutually independent across all regimes, and where RR cannot be expressed as a functional of the trajectory or occupation measure. If such a system exists, the framework’s claim to unify persistence, stability, and reality alignment would be falsified.


7. Experimental Design

7.1 System Choice

Train a CNN on MNIST or CIFAR-10. Use latent activation manifolds for topological analysis.

Justification: Karuppiah, Nazreen Banu et al. (2026) demonstrate the use of persistent homology on activations to study feature learning and generalization. Turner & Barak (2023) show that RNNs develop attractors sequentially, providing a controlled setting for studying topological evolution during learning.

7.2 Variable Measurement

Variable Protocol
DT(x)DT​(x) Sample weights; compute distance to final attractor; integrate.
Ptopo(t)Ptopo​(t) Compute persistent homology on latent activations; sum feature lifetimes.
E(t)E(t) Finite differences of Ptopo(t)Ptopo​(t).
κκ Perturb weights; measure recovery time ττκ=1/τκ=1/τ.
γγ Compute average drift rate during training.
RR Cross-domain generalization accuracy.

7.3 Statistical Analysis

  • Correlate E(t)E(t) with κκ and γγ conditional on regime.
  • Pre-register thresholds and sample size.

Note on future empirical work: A full empirical validation would require pre-registration with specified sample size, significance thresholds, power analysis, and robustness checks. These are planned for subsequent work.


8. Discussion

8.1 Implications

The paper provides a candidate formalization with defined variables, mathematical properties, and testable predictions. The mathematical properties of DTDT​ establish its relationship to κκ and provide a foundation for the framework’s core claims.

8.2 Limitations

  • PtopoPtopo​ is computationally expensive.
  • The framework is a meta-theory, not a complete domain-specific theory.
  • Variables may be confounded; causal inference requires controlled experiments.
  • The κ/γκ/γ regime distinction is proposed and requires empirical validation.

8.3 Future Work

  • Empirical validation of predictions.
  • Formal derivation of relationships from first principles.
  • Extension to other domains.
  • Computational efficiency improvements.

9. Conclusion

This paper proposes a candidate formalization for the attractor framework. The central mathematical innovation is treating persistence as a functional defined over trajectories—DT(x)=0Td(ϕτ(x),A)dτDT​(x)=∫0Td(ϕτ​(x),A)dτ—rather than as a scalar property of states. We defined the cumulative deviation functional DTDT​, the topological persistence functional Ptopo(t)Ptopo​(t), and the topological evolution rate E(t)E(t). We proved several mathematical properties of DTDT​, including non-negativity, monotonicity, additivity, Lipschitz continuity, and a bound relating DD∞​ to κκD(x)Cκd(x,A)D∞​(x)≤κCd(x,A). We established connections to dynamic programming and ergodic theory. We unified the variable set with operational definitions. We derived testable predictions and provided a falsifiable experimental protocol.

The framework now admits formal definitions, operational variables, and empirical tests. The next step is empirical validation.


Appendix A: Possible Extensions from Larose (2025) — Unverified Source

Note: The following source has not been independently verified. It is included for completeness and as a potential direction for future exploration, but should not be treated as established.

Larose (2025) develops a framework for recursive deformation systems. Two constructs are potentially relevant:

Constraint Functional: C(X)=trajectoryΦdτC(X)=∫trajectory​∥∇Φ∥dτ, measuring cumulative irreversible deformation.

Persistence Invariant: Ip=RdΦIp​=∮RdΦ, a topological invariant.

These are not yet integrated into the core framework and are presented here for completeness and future exploration. They should be treated as unverified candidate extensions.


References

Arnold, L. (1998). Random Dynamical Systems. Springer.

Berglund, L., et al. (2024). “The Reversal Curse: LLMs Trained on ‘A is B’ Fail to Learn ‘B is A’.” arXiv:2309.12288.

Bowen, R. (1975). Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. Springer.

Carlsson, G. (2009). “Topology and data.” Bulletin of the American Mathematical Society, 46(2), 255-308.

Carlsson, G., & Zomorodian, A. (2009). “The theory of multidimensional persistence.” Discrete & Computational Geometry, 42(1), 71-93.

Clark, D. G., Abbott, L. F., & Litwin-Kumar, A. (2023). “Dimension of activity in random neural networks.” Physical Review Letters, 131, 118401.

Edelsbrunner, H., & Harer, J. (2010). Computational Topology: An Introduction. American Mathematical Society.

Engelken, R., Wolf, F., & Abbott, L. F. (2023). “Lyapunov spectra of chaotic recurrent neural networks.” Physical Review Research, 5, 043044.

Fournier, S. J., & Urbani, P. (2023). “Statistical physics of learning in high-dimensional chaotic systems.” Journal of Statistical Mechanics: Theory and Experiment, 2023(11), 113301.

Karuppiah, K., Nazreen Banu, M., et al. (2026). “Topological Data Analysis (TDA) as a Framework for Understanding Deep Learning Behavior.” 2025 IEEE 5th International Conference on ICT in Business Industry & Government (ICTBIG), Indore, India, December 12-13, 2025. IEEE Xplore. DOI: 10.1109/ICTBIG68706.2025.11323998.

Larose, H. (2025). “A Mathematical Theory of Frame-Independent Persistence.” Academia.edu. [Unverified source.]

Ruelle, D. (1989). Chaotic Evolution and Strange Attractors. Cambridge University Press.

Sompolinsky, H., Crisanti, A., & Sommers, H. J. (1988). “Chaos in Random Neural Networks.” Physical Review Letters, 61(3), 259-262.

Turner, E., & Barak, O. (2023). “The Simplicity Bias in Multi-Task RNNs: Shared Attractors, Reuse of Dynamics, and Geometric Representation.” Advances in Neural Information Processing Systems (NeurIPS).


Suggested citation: Galida, R. S. (2026). The Persistence Functional: A Candidate Formal Foundation for the Attractor Framework (Foundational Edition). Fantasy Attractor.