Appearance
LSA / LSI / LDA Geometry
This guide records the platform's intended interpretation of LSA, LSI, and LDA.
Integer substrate
The semantic stack begins on a discrete substrate: characters, tokens, counts, and document-term structure.
We can represent the corpus as a weighted matrix:
where
LSA as a closed latent basis
For LSA we compute a truncated singular value decomposition:
In this platform's conceptual model, LSA is the closed latent basis. Once
This makes LSA well suited for:
- deterministic linear-algebraic compression
- similarity
- clustering
- reconstruction-aware semantic analysis
LSI as the bridge regime
In this platform's model, LSI is not identical to LSA in role, even though both depend on truncated SVD ideas.
LSI is the bridge between local closed structure and broader search/retrieval dynamics. It projects queries and documents into the latent space for operational retrieval:
and document embeddings can be represented in the latent basis by:
So in our conceptual language:
- LSA = closed basis
- LSI = curved bridge between closed basis and broader retrieval space
LDA as the open topic family
For LDA, documents are mixtures of topics and topics are distributions over words:
In this platform's conceptual model, LDA is the open topic family. It is the unbounded topic simplex beyond the fixed closed basis regime.
Projection, annealing, diffusion
The diagrams imply three directional transformations:
Projection
Move from discrete observed substrate into latent structure.
Annealing
Smooth local noisy structure into stable closed latent basis.
Diffusion
Open the closed latent basis outward into broader topic families and semantic drift.
The aligned 23rd channels
The 22-basis regime remains the contributing latent family. The comparison happens at the 23rd channel:
These are aligned variants:
- LSA23 = closed summary channel over the latent LSA-22 basis
- LSI23 = bridge channel from local closed structure toward broader retrieval drift
- LDA23 = open topic extension beyond the fixed latent basis family