MARINA · Open Source · CPU-Trainable

Rethinking Protein Folding

From sequence statistics to conformational attractors

TakensFold applies Takens' delay embedding theorem to protein structure prediction, treating the amino-acid sequence as the observable of a dynamical system converging to a stable folded attractor. The MARINA architecture achieves O(N) complexity and O(1) fixed memory — no attention, no positional encodings.

Simul Pariter — Together Equally
1.01 Å Overall RMSD
0.62 Å Mean per-residue RMSD
~15M Parameters
CPU No GPU required

A Different Lens on Protein Folding

Standard approaches treat protein folding as a learned sequence-to-structure mapping. TakensFold treats it as a problem in nonlinear dynamical systems — recovering a hidden attractor geometry from an observed time series.

Standard View

Protein folding is framed as a sequence-to-structure mapping. Large neural networks learn statistical correlations between amino-acid sequences and known 3D coordinates. Attention mechanisms require O(N²) computation and growing memory — demanding GPU clusters and massive datasets.

Sequence → Structure (statistical)

Dynamical View

Protein folding is a temporal dynamical process. A newly synthesised polypeptide explores conformational space, converging to a stable geometric attractor. The amino-acid sequence is the observable. Takens' theorem lets us reconstruct the attractor geometry from exponential delay coordinates of that sequence.

Trajectory → Attractor (geometric)

Phase Space Reconstruction for Proteins

The amino-acid sequence is not merely a list of letters. It is a one-dimensional symbolic construction signal from which a three-dimensional molecular object is built.

Takens' theorem states that, under mild conditions, the full state space of a deterministic dynamical system can be reconstructed from delayed observations of a single time series. For a protein, the observable is the residue embedding processed position-by-position. The hidden state is the evolving conformation in 3D space.

Delay coordinates of the form:

z(t) = [e(t), e(t−1), e(t−2), e(t−4), e(t−8), e(t−16), e(t−32), e(t−64), e(t−128)]

produce a trajectory diffeomorphic to the original conformational attractor. Exponential spacing captures the natural multi-scale organisation of proteins:

  • Short delays: local backbone geometry and side-chain contacts
  • Medium delays: secondary structure (helices, sheets)
  • Long delays: tertiary topology and domain organisation
  • Fixed memory: O(1) circular buffer regardless of sequence length

Multi-Scale Structure via Delay Spacing

τ = 1–2 · Local backbone
τ = 4–8 · Secondary structure
τ = 16–32 · Domain contacts
τ = 64–128 · Tertiary topology

Delays = [1, 2, 4, 8, 16, 32, 64, 128]

Triplication Training Strategy

Training proteins are triplicated in the preprocessing pipeline. In a statistical model, repetition adds no new information. In a Takens-based architecture, repeated exposure deepens the learned attractor basins and thickens conformational trajectory filaments in phase space — directly improving prediction accuracy on structurally similar proteins.

MARINA Architecture

Manifold-Aware Reconstruction and Inference Network Architecture — four core components, no attention, no positional encodings, scales linearly with sequence length.

1

Residue Encoding

Each of the 20 standard amino acids (plus non-standard extension vocabulary) is mapped to a learned embedding vector of dimension embed_dim = 128. No positional encodings are used — temporal order is encoded implicitly through the delay structure.

2

Exponential Takens Embedding

At each position t, a delay-coordinate vector is constructed using a circular buffer of size 2k+1 (k = 7 for the longest delay of 128). This yields O(1) memory usage independent of sequence length. The resulting state vector has dimension (8 + 1) × 128 = 1,152.

3

Adaptive Manifold Projection

The high-dimensional delay vector is projected onto a lower-dimensional manifold via a learned projection matrix Wp followed by LayerNorm. This matrix is the geometric core of the model — its rows encode which combinations of temporal scales are most informative for structure prediction.

4

Temporal Mixing & Coordinate Head

A stack of 6 feedforward residual layers (hidden dimension 512) performs non-linear mixing within each position's manifold state. Three independent linear heads then predict the x, y, z coordinates of the Cα atom. Training uses MSE loss in Ångström space.

Architecture Comparison

Property AlphaFold2 / ESMFold MARINA (TakensFold)
Context mechanism Multi-head attention (Evoformer) Exponential delay coordinates
Complexity per position O(N²) O(log N)
Memory footprint O(N) growing O(1) fixed circular buffer
Positional encodings Yes (multiple forms) None — implicit in delays
Attention Central mechanism None
Hardware requirement GPU cluster (hundreds of TPUs) CPU sufficient (i7, 32 GB RAM)
Training data scale Hundreds of millions of sequences ~300–400 proteins (proof of concept)
Interpretability Attention weights (indirect) Manifold geometry (direct)
Open reproducibility Partial Full (Mozilla Public License 2.0)

Results

A proof-of-concept trained from scratch on modest hardware — demonstrating that the MARINA architecture can reconstruct coherent protein geometry from residue sequences without attention.

Structure Prediction

PDB 1A7S — 227 Residues

1.01 Å Overall RMSD
0.62 Å Mean per-residue RMSD

The model reconstructs coherent backbone geometry from the amino-acid sequence alone. The N-terminal region shows elevated error due to greater conformational freedom of termini; the remainder of the chain is predicted at ~0.5 Å.

  • In-training example: demonstrates architectural viability
  • ~15M parameters, Intel i7 CPU, 32 GB RAM
  • ~300–400 training proteins (triplicated)
  • Fully reproducible via open-source repository
Binding Affinity

PDB 1E2F — Binding Programme

1.39 Å Aligned RMSD (1E2F)
Multiscale Correspondence model

Protein-ligand affinity is reframed as multiscale correspondence between two construction signals — the amino-acid sequence (protein) and the SMILES string (ligand). Both are observed symbolic time series from which hidden geometric constraints are reconstructed by Takens-style delay embeddings.

  • Affinity is not a local property — it is a global multiscale relationship
  • SMILES treated as a one-dimensional symbolic construction code
  • Binding pocket emerges from whole-protein multiscale construction
  • Theoretical programme detailed in Parts 1 & 2

Key Insight: Attractor Reconstruction, Not Pattern Matching

Standard protein models learn correlations between sequences and known structures. MARINA instead reconstructs the attractor geometry of the folding dynamical system. Repeated training exposure does not add statistical redundancy — it deepens geometric basin structure in phase space, directly analogous to the "memory fibre" phenomenon observed in the language modelling experiments. The projection matrix Wp and manifold trajectories offer direct geometric interpretability: rows of Wp reveal learned temporal scales; phase-space analysis can probe attractor stability and mutation effects.

The Papers

Paper 1

Takens-Based Transformer for Protein Structure Prediction

Kevin R. Haylett | May 2026 | Selected Communications

Protein folding is reframed as phase-space reconstruction using Takens' delay embedding theorem. Rather than relying on attention or statistical pattern matching, the MARINA model reconstructs the folded geometry directly from exponential delay coordinates of the amino-acid sequence. Achieves 1.01 Å overall RMSD on 1A7S (227 residues), trained on modest hardware. Complete implementation released as open-source code.

Highlights

  • Theoretical foundation: Protein folding as convergence to a conformational attractor
  • MARINA architecture: Four components, attention-free, encoding-free
  • Triplication strategy: Deepens attractor basins rather than adding statistical redundancy
  • Full reproducibility: Training pipeline, inference scripts, and results released
  • Consumer hardware: Intel i7 CPU, 32 GB RAM sufficient
Paper 2

Protein-Ligand Affinity as Multiscale Correspondence

Kevin R. Haylett | May 2026 | Selected Communications · Parts 1 & 2

A Takens-based programme for sequence-to-structure and affinity modelling. The central argument: binding affinity is not a local property of a ligand touching a binding pocket. It is a measured scalar imposed on a multiscale correspondence between two construction signals — a protein sequence and a ligand SMILES string. Takens-style delay embeddings offer a principled method for reconstructing hidden geometric constraints from both signals.

Highlights

  • Conceptual reframing: Affinity as multiscale correspondence, not local docking score
  • Ligand as signal: SMILES string treated as one-dimensional symbolic construction code
  • Data clarity: Systematic reconstruction of what is measured, inferred, and compressed
  • Programme scope: Theoretical foundations and experimental directions for future scaling
  • Illustrated: 1E2F in-training example (1.39 Å RMSD)

Citation

Kevin R. Haylett, "Takens-Based Transformer for Protein Structure Prediction: A Proof-of-Concept Implementation with Open-Source Code," Selected Communications (May 2026), code available at https://github.com/KevinHaylett/takens-protein-prediction

Quick Start

The complete codebase enables full reproducibility on consumer hardware. All training and inference commands are documented in the repository README.

Install

pip install torch numpy pandas \
    matplotlib biopython

Prepare Data

python pipeline/pdb_to_csv_batch.py
python pipeline/pdb_to_training.py

Train & Predict

python train.py
python inference.py

Further Exploration

TBT Language Modelling

The same MARINA architecture applied to language: replacing attention with exponential delay coordinates for O(N) complexity text generation. Three experiments across general, factual, and creative domains.

Takens Language Site →

Geofinitism

The philosophical framework underlying this work. Meaning, mathematics, and measurement treated as finite, relational, and dynamical — symbols as finite marks embedded in trajectories of use, not Platonic abstractions.

Visit FiniteMechanics.com →

Substack

Essays and updates on geometric thinking, dynamical systems, protein structure, and the broader TBT research programme.

Read on Substack →

About

Kevin R. Haylett, PhD is an independent researcher with 25+ years of experience in medical engineering, neural networks, and nonlinear dynamical systems. His work applies dynamical systems theory to language models, protein structure, and the foundations of AI — emphasising finite measurement, geometric interpretability, and reproducibility on modest hardware.

TakensFold is one application of the broader Takens-Based Transformer programme, which has also been applied to language modelling and time-series tasks. The goal is not to replace existing large-scale methods, but to demonstrate that attractor reconstruction offers a principled, interpretable, and computationally efficient alternative route through the problem space.

For more background: A Journey Through Rhythms — from Heartbeats to Language Models

Manchester, UK · kevin.haylett@gmail.com