TakensFold | Protein Structure Prediction via Phase Space Reconstruction

A Different Lens on Protein Folding

Standard approaches treat protein folding as a learned sequence-to-structure mapping. TakensFold treats it as a problem in nonlinear dynamical systems — recovering a hidden attractor geometry from an observed time series.

Standard View

Protein folding is framed as a sequence-to-structure mapping. Large neural networks learn statistical correlations between amino-acid sequences and known 3D coordinates. Attention mechanisms require O(N²) computation and growing memory — demanding GPU clusters and massive datasets.

Dynamical View

Protein folding is a temporal dynamical process. A newly synthesised polypeptide explores conformational space, converging to a stable geometric attractor. The amino-acid sequence is the observable. Takens' theorem lets us reconstruct the attractor geometry from exponential delay coordinates of that sequence.

Phase Space Reconstruction for Proteins

The amino-acid sequence is not merely a list of letters. It is a one-dimensional symbolic construction signal from which a three-dimensional molecular object is built.

Takens' theorem states that, under mild conditions, the full state space of a deterministic dynamical system can be reconstructed from delayed observations of a single time series. For a protein, the observable is the residue embedding processed position-by-position. The hidden state is the evolving conformation in 3D space.

Delay coordinates of the form:

z(t) = [e(t), e(t−1), e(t−2), e(t−4), e(t−8), e(t−16), e(t−32), e(t−64), e(t−128)]

produce a trajectory diffeomorphic to the original conformational attractor. Exponential spacing captures the natural multi-scale organisation of proteins:

Short delays: local backbone geometry and side-chain contacts
Medium delays: secondary structure (helices, sheets)
Long delays: tertiary topology and domain organisation
Fixed memory: O(1) circular buffer regardless of sequence length

Multi-Scale Structure via Delay Spacing

τ = 1–2 · Local backbone

τ = 4–8 · Secondary structure

τ = 16–32 · Domain contacts

τ = 64–128 · Tertiary topology

Delays = [1, 2, 4, 8, 16, 32, 64, 128]

Triplication Training Strategy

Training proteins are triplicated in the preprocessing pipeline. In a statistical model, repetition adds no new information. In a Takens-based architecture, repeated exposure deepens the learned attractor basins and thickens conformational trajectory filaments in phase space — directly improving prediction accuracy on structurally similar proteins.

MARINA Architecture

Manifold-Aware Reconstruction and Inference Network Architecture — four core components, no attention, no positional encodings, scales linearly with sequence length.

1

Residue Encoding

Each of the 20 standard amino acids (plus non-standard extension vocabulary) is mapped to a learned embedding vector of dimension embed_dim = 128. No positional encodings are used — temporal order is encoded implicitly through the delay structure.

2

Exponential Takens Embedding

At each position t, a delay-coordinate vector is constructed using a circular buffer of size 2^k+1 (k = 7 for the longest delay of 128). This yields O(1) memory usage independent of sequence length. The resulting state vector has dimension (8 + 1) × 128 = 1,152.

3

Adaptive Manifold Projection

The high-dimensional delay vector is projected onto a lower-dimensional manifold via a learned projection matrix W_p followed by LayerNorm. This matrix is the geometric core of the model — its rows encode which combinations of temporal scales are most informative for structure prediction.

4

Temporal Mixing & Coordinate Head

A stack of 6 feedforward residual layers (hidden dimension 512) performs non-linear mixing within each position's manifold state. Three independent linear heads then predict the x, y, z coordinates of the Cα atom. Training uses MSE loss in Ångström space.

Architecture Comparison

Property	AlphaFold2 / ESMFold	MARINA (TakensFold)
Context mechanism	Multi-head attention (Evoformer)	Exponential delay coordinates
Complexity per position	O(N²)	O(log N)
Memory footprint	O(N) growing	O(1) fixed circular buffer
Positional encodings	Yes (multiple forms)	None — implicit in delays
Attention	Central mechanism	None
Hardware requirement	GPU cluster (hundreds of TPUs)	CPU sufficient (i7, 32 GB RAM)
Training data scale	Hundreds of millions of sequences	~300–400 proteins (proof of concept)
Interpretability	Attention weights (indirect)	Manifold geometry (direct)
Open reproducibility	Partial	Full (Mozilla Public License 2.0)

Results

A proof-of-concept trained from scratch on modest hardware — demonstrating that the MARINA architecture can reconstruct coherent protein geometry from residue sequences without attention.

Structure Prediction

PDB 1A7S — 227 Residues

1.01 Å Overall RMSD

0.62 Å Mean per-residue RMSD

The model reconstructs coherent backbone geometry from the amino-acid sequence alone. The N-terminal region shows elevated error due to greater conformational freedom of termini; the remainder of the chain is predicted at ~0.5 Å.

In-training example: demonstrates architectural viability
~15M parameters, Intel i7 CPU, 32 GB RAM
~300–400 training proteins (triplicated)
Fully reproducible via open-source repository

Binding Affinity

PDB 1E2F — Binding Programme

1.39 Å Aligned RMSD (1E2F)

Multiscale Correspondence model

Protein-ligand affinity is reframed as multiscale correspondence between two construction signals — the amino-acid sequence (protein) and the SMILES string (ligand). Both are observed symbolic time series from which hidden geometric constraints are reconstructed by Takens-style delay embeddings.

Affinity is not a local property — it is a global multiscale relationship
SMILES treated as a one-dimensional symbolic construction code
Binding pocket emerges from whole-protein multiscale construction
Theoretical programme detailed in Parts 1 & 2

Key Insight: Attractor Reconstruction, Not Pattern Matching

Standard protein models learn correlations between sequences and known structures. MARINA instead reconstructs the attractor geometry of the folding dynamical system. Repeated training exposure does not add statistical redundancy — it deepens geometric basin structure in phase space, directly analogous to the "memory fibre" phenomenon observed in the language modelling experiments. The projection matrix W_p and manifold trajectories offer direct geometric interpretability: rows of W_p reveal learned temporal scales; phase-space analysis can probe attractor stability and mutation effects.

The Papers

Paper 1

Takens-Based Transformer for Protein Structure Prediction

Kevin R. Haylett | May 2026 | Selected Communications

Protein folding is reframed as phase-space reconstruction using Takens' delay embedding theorem. Rather than relying on attention or statistical pattern matching, the MARINA model reconstructs the folded geometry directly from exponential delay coordinates of the amino-acid sequence. Achieves 1.01 Å overall RMSD on 1A7S (227 residues), trained on modest hardware. Complete implementation released as open-source code.

Highlights

Theoretical foundation: Protein folding as convergence to a conformational attractor
MARINA architecture: Four components, attention-free, encoding-free
Triplication strategy: Deepens attractor basins rather than adding statistical redundancy
Full reproducibility: Training pipeline, inference scripts, and results released
Consumer hardware: Intel i7 CPU, 32 GB RAM sufficient

Download PDF View Online

Paper 2

Protein-Ligand Affinity as Multiscale Correspondence

Kevin R. Haylett | May 2026 | Selected Communications · Parts 1 & 2

A Takens-based programme for sequence-to-structure and affinity modelling. The central argument: binding affinity is not a local property of a ligand touching a binding pocket. It is a measured scalar imposed on a multiscale correspondence between two construction signals — a protein sequence and a ligand SMILES string. Takens-style delay embeddings offer a principled method for reconstructing hidden geometric constraints from both signals.

Highlights

Conceptual reframing: Affinity as multiscale correspondence, not local docking score
Ligand as signal: SMILES string treated as one-dimensional symbolic construction code
Data clarity: Systematic reconstruction of what is measured, inferred, and compressed
Programme scope: Theoretical foundations and experimental directions for future scaling
Illustrated: 1E2F in-training example (1.39 Å RMSD)

Download Part 1 Download Part 2

Citation

Kevin R. Haylett, "Takens-Based Transformer for Protein Structure Prediction: A Proof-of-Concept Implementation with Open-Source Code," Selected Communications (May 2026), code available at https://github.com/KevinHaylett/takens-protein-prediction

Quick Start

The complete codebase enables full reproducibility on consumer hardware. All training and inference commands are documented in the repository README.

Install

pip install torch numpy pandas \
    matplotlib biopython

Prepare Data

python pipeline/pdb_to_csv_batch.py
python pipeline/pdb_to_training.py

Train & Predict

python train.py
python inference.py

KevinHaylett/takens-protein-prediction

Mozilla Public License 2.0 · Results folder includes 1A7S example outputs

Further Exploration

TBT Language Modelling

The same MARINA architecture applied to language: replacing attention with exponential delay coordinates for O(N) complexity text generation. Three experiments across general, factual, and creative domains.

Takens Language Site →

Geofinitism

The philosophical framework underlying this work. Meaning, mathematics, and measurement treated as finite, relational, and dynamical — symbols as finite marks embedded in trajectories of use, not Platonic abstractions.

Visit FiniteMechanics.com →

Substack

Essays and updates on geometric thinking, dynamical systems, protein structure, and the broader TBT research programme.

Read on Substack →

Related Papers

About

Kevin R. Haylett, PhD is an independent researcher with 25+ years of experience in medical engineering, neural networks, and nonlinear dynamical systems. His work applies dynamical systems theory to language models, protein structure, and the foundations of AI — emphasising finite measurement, geometric interpretability, and reproducibility on modest hardware.

TakensFold is one application of the broader Takens-Based Transformer programme, which has also been applied to language modelling and time-series tasks. The goal is not to replace existing large-scale methods, but to demonstrate that attractor reconstruction offers a principled, interpretable, and computationally efficient alternative route through the problem space.

For more background: A Journey Through Rhythms — from Heartbeats to Language Models

Manchester, UK · kevin.haylett@gmail.com

Rethinking Protein Folding

A Different Lens on Protein Folding

Standard View

Dynamical View

Phase Space Reconstruction for Proteins

Multi-Scale Structure via Delay Spacing

Triplication Training Strategy

MARINA Architecture

Residue Encoding

Exponential Takens Embedding

Adaptive Manifold Projection

Temporal Mixing & Coordinate Head

Architecture Comparison

Results

PDB 1A7S — 227 Residues

PDB 1E2F — Binding Programme

Key Insight: Attractor Reconstruction, Not Pattern Matching

The Papers

Takens-Based Transformer for Protein Structure Prediction

Highlights

Protein-Ligand Affinity as Multiscale Correspondence

Highlights

Citation

Quick Start

Install

Prepare Data

Train & Predict

Further Exploration

TBT Language Modelling

Geofinitism

Substack

Related Papers

About