You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Ancient.AI.V โ€” Recursive Language Model

Architecture: Recursive Language Model (RLM) Not a Large Language Model โ€” a fundamentally different architecture built from scratch.

Property Value
Parameters 1.147B
Context Window 64,000 tokens
Layers 24
Hidden Size 2,048
Attention Heads 16 (GQA, 8 KV heads)
FFN Dimension 8,192
Vocab Size 64,000
Activation SwiGLU
Position Encoding YaRN-extended RoPE (base 500k, scale 8ร—)
Weight Format safetensors
Precision bfloat16 (fine-tune target)

What Makes It Different From an LLM

Standard LLMs run one forward pass: input โ†’ output.

Ancient.AI.V runs a Recursive Outer Loop: the model refines its own output recursion_depth times per call, with a learned halting gate that stops early when confident. This is the core of the Recursive Language Model paradigm.


Integrated Self-Automated (SA) Modules

All 17 SA modules operate simultaneously within each decoder layer as parallel residual paths โ€” not sequential post-processing steps.

Module Implementation
SA Meta-Learning Per-sample fast-weight delta generation (learned MAML inner loop)
SA Reinforcement Learning Per-token value estimation + policy gate (actor-critic in forward pass)
SA Continual Learning EWC-inspired importance weighting from initial representations
SA Adaptive Learning Learned depth-gating; tokens can exit processing early
SA Rewriting Cross-attention from current โ†’ earlier hidden states (in-context revision)
SA NLP Bigram/trigram convolutions + semantic role projection
SA Problem Solving Multi-step latent chain-of-thought scratchpad (3 internal steps)
SA Innovation Novelty-promoting repulsion in embedding space
SA Debugging Anomaly detection + learned correction on hidden state norms
SA Long/Short-Term Memory 512 persistent learnable memory slots with read/write gating
SA Recursive Seed Learning Compress โ†’ refine โ†’ expand self-representation cycle
SA Self-Evaluation & Reward Per-token reward MLP; plugs directly into PPO/GRPO fine-tuning
SA Goal & Constraint Engine Learned goal embedding cross-attends to steer generation
SA Memory Consolidation Bidirectional GRU trace encoder with hippocampal replay
SA Introspection Interface Uncertainty + confidence mapping over hidden states
SA Recursive Outer Loop Post-stack self-refinement with learned halting
SA Conversational Intelligence Dialogue state tracker (turn, topic shift, emotion, formality)

Multimodal Support

Native encoders for all four modalities, fused before the decoder stack:

  • Text โ€” BPE tokenizer, 64k vocab
  • Image โ€” ViT-style patch encoder (16ร—16 patches, up to 224ร—224)
  • Audio โ€” Whisper-style mel-spectrogram encoder (80 mel bins)
  • Video โ€” Frame-by-frame ViT + temporal self-attention

Training / Fine-Tuning

This checkpoint contains randomly initialized weights โ€” it is an architecture shell ready for fine-tuning.

Recommended fine-tuning approaches:

  • SFT (Supervised Fine-Tuning) with causal LM loss
  • RLHF/PPO โ€” plug training reward into the SASelfEvaluation reward head
  • GRPO โ€” the sa_eval reward signal is already shaped for group-relative optimization
  • LoRA / QLoRA โ€” compatible with standard PEFT adapters

Training the self-reward head jointly with SFT gives Ancient.AI.V self-improvement capability without a separate reward model.


Usage

# AutoTokenizer available after fine-tuning with a trained tokenizer
from ancient_ai import AncientConfig, AncientAIV  # after registering custom class
import torch

cfg   = AncientConfig()
model = AncientAIV(cfg)
# Load weights:
# model = AncientAIV.from_pretrained("GODsStrongestSoldier/Ancient.AI.V")

tokenizer = AutoTokenizer.from_pretrained("GODsStrongestSoldier/Ancient.AI.V")
input_ids = tokenizer("Hello Ancient.AI", return_tensors="pt").input_ids

generated = model.generate_text(input_ids, max_new=200, temperature=0.8)
print(tokenizer.decode(generated[0]))

Architecture Citation

Ancient.AI.V โ€” Recursive Language Model (RLM)
Author: GODsStrongestSoldier
Year: 2025
Architecture: Custom RLM with 17 integrated SA modules
Repo: https://huggingface.co/GODsStrongestSoldier/Ancient.AI.V

License

Apache 2.0 โ€” free for research and commercial fine-tuning.

Downloads last month
-
Safetensors
Model size
6B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including WithinUsAI/Ancient.AI.V