Mental Models: Mathematical Structures as Mind

Basis & Transformation

The same object, viewed in a different set of axes, can make a hard problem dissolve — many problems are hard only because you're in the wrong basis

Explanation

The heart of linear algebra isn't "crunching matrices" — it's an epistemological claim: an object is invariant, but the basis (set of coordinate axes) you describe it in decides how complex it looks. The same vector has messy coordinates in one basis and a single component in another. A change of basis is a change of perspective, and the right one makes a tangled problem decouple on its own.

Non-trivial points: ① Eigenvectors are the directions a transformation only stretches, never rotates — a system's natural axes. Find the eigenbasis and a coupled transformation becomes independent scalings of each component (diagonalization). Finding eigenvectors is asking "in what directions does this system keep itself?" ② Seemingly unrelated techniques — Fourier transforms, principal component analysis (PCA), word embeddings — are all the same move: re-express the data in a basis where the problem becomes linear and separable. ③ The whole deep-learning revolution is one sentence: automatically learn a basis where entangled data becomes linearly separable. "Representation" just means "choice of basis."

Same vector, right basis — the description compresses hard. The problem didn't change; its complexity did

Classic example

The Fourier transform: a messy sound wave is a writhing curve in the time domain, nearly intractable; switch to the frequency basis (sinusoids as axes) and the same signal becomes a few clean vertical lines — denoising, compression, and tuning turn trivial. The signal didn't change; a change of basis made the hard problem vanish.

Scenario · BigCat

LLM word embeddings: words live in a high-dimensional space, and "king − man + woman ≈ queen" works because the learned basis aligns semantic axes like "gender" and "royalty" into independent coordinates. Likewise a stuck product decision — if you keep weighing "cost vs experience" on entangled axes, try another basis (say "reversibility vs learning value"), and the dilemma often decouples into two separately decidable problems.

AI Prompt

English Prompt

I'm stuck in a tradeoff: [describe the problem and the 2-3 dimensions I currently weigh it on]. Use the "change of basis" lens: 1. Name the coordinate axes I'm implicitly using, and whether they're entangled (moving one drags the other). 2. Propose 2 alternative bases (sets of dimensions); say which one makes the dilemma decouple into separately decidable subproblems. 3. Identify the "eigen-direction" of this problem — the axis along which the system stays invariant and is most worth betting on.

Continuity & Invariants

A coffee mug and a donut are "the same" — topology teaches you to ignore size and detail and watch only what can't be torn or pulled off

Explanation

Topology studies the properties that survive continuous deformation (stretching, bending — but no tearing, no gluing). A handled coffee mug can be continuously squeezed into a donut; to topology they are the same object, because both have exactly one hole. The power: once you stop fussing over distance, angle, and size — all mutable — and instead ask "what can't change no matter how I deform this," you've grasped the system's essential skeleton.

Non-trivial points: ① Topological invariants (connectedness, number of holes, boundaries) jump only when something tears — so they're robust to noise and perturbation: stable under any wiggle, and when they change, it's a qualitative leap. This is the mathematical language of phase transitions, identity persistence, and critical points. ② "Can you get from A to B" is purely topological, independent of distance — that's why it's more fundamental than metric. ③ A Buddhist resonance: a river, a "self" — endlessly deforming, yet seemingly carrying some invariant (the continuing stream) through it all; topology also allows the other answer — perhaps there is no invariant, only the deformation itself (non-self). The discipline is to first separate the mutable details from the skeleton that breaks only when torn.

Classic example

The Seven Bridges of Königsberg: can you walk every bridge exactly once? Euler's insight was to erase the shapes and distances of islands and bridges, keeping only the graph of "what connects to what." Once reduced to pure connectivity, the answer is immediate (impossible). This was the founding moment of topological thinking: the key to the problem is first throwing away the irrelevant metric detail.

Scenario · BigCat

In distributed systems, a "network partition" is essentially a topological event: connectivity doesn't degrade gradually — at some instant it breaks discontinuously, forcing the CAP tradeoff (you can keep consistency or availability, not both). In parenting: a child's core temperament is a topological invariant — it gets continuously stretched and shaped by growth, but you can't "tear" it off by force. The wise move isn't to fight the invariant but to deform along with it — channel the same energy toward a different outlet.

AI Prompt

English Prompt

I'm analyzing this system / relationship / plan: [describe]. Use the "topological invariant" lens to separate skeleton from detail: 1. Which parts are continuously deformable details (numbers, scale, shape — can change without harming the essence)? 2. Which are invariants (the core that only breaks when torn — connectivity, dependencies, identity, underlying structure)? 3. Is anything approaching a "tearing point" (a discontinuous jump / phase change in an invariant)? How can I detect it early?

Composition & Isomorphism

Don't look at what a thing is made of — look at how it connects and chains to other things. The essence lives in the arrows, not the boxes

Explanation

Category theory proposes a radical shift: don't study an object's internal construction; study the arrows (maps) between objects and how those arrows compose. A thing's essence lies not in its material but in how it relates to and chains with other things. This is exactly the mathematical origin of software engineering's "program to the interface, not the implementation" — what a module is matters less than its input/output types and whether it composes cleanly with others.

Non-trivial points: ① Composability is the whole game: if A→B and B→C reliably chain into A→C, you can stack arbitrarily complex systems without losing control. This is the mathematics of modularity — Lego is Lego only because its interfaces compose. ② Isomorphism is the rigorous form of analogy: when an invertible, composition-preserving map exists between two structures, they are mathematically the same thing — every theorem proven in one domain transfers intact to the other. Recognizing "my scheduling problem is isomorphic to graph coloring" means inheriting that domain's entire solved body of results for free. ③ A functor is a structure-preserving translation — a "reliable analogy" that maps the whole web of relations across seamlessly.

Watch the arrows and composition: only if A→B→C chains cleanly into A→C can a system stack without end

Classic example

Function composition and "interface over implementation": the Unix pipe cat | grep | sort is category theory's spirit made live — each command is an arrow (input stream → output stream); you don't care how it's written internally, only that the types connect, so you can chain them into powerful pipelines at will. The entire Unix philosophy is one line: make small programs composable.

Scenario · BigCat

Designing an AI agent pipeline: each agent is an arrow (input type → output type), and whether the system runs depends on whether one's output type cleanly connects to the next's input — designing agents is designing composable morphisms. And when you notice your "team task-scheduling problem" is isomorphic to graph coloring, you can import that field's mature algorithms wholesale, saving the cost of thinking from scratch. Spotting an isomorphism is the sharpest move in cross-disciplinary transfer.

AI Prompt

English Prompt

I'm designing / decomposing this system: [describe]. Use the "composition & isomorphism" lens: 1. Break it into arrows (each component's input type → output type); flag the two interfaces where types don't line up cleanly and will be fragile. 2. Which parts' internal implementation should I deliberately ignore, watching only the interface contract? 3. Is this problem isomorphic to a well-studied classic one? If so, what mature solution can I import wholesale?

Dimensionality & Similarity

Intelligence is finding the few axes that truly matter and flattening a high-dimensional world — but "similar" is never absolute; it depends on which space you flattened into

Explanation

High-dimensional data almost always curls up on a low-dimensional "manifold": it looks like hundreds of variables, yet only a handful of directions truly vary. A core act of intelligence is finding those few axes that matter and compressing away the rest — that is compression, and it is abstraction. And "similarity" hides an often-missed trap: without specifying a space and a metric, "similar" is meaningless. Two things sit close together in one projection and opposite in another.

Non-trivial points: ① Good dimensionality reduction keeps the signal and drops the noise — the same coin as Card 1's "choosing a basis" (e.g. PCA). ② The curse of dimensionality: in high dimensions, volume concentrates near the shell and "the distance between any two points tends to equalize" — so "nearest neighbor" loses meaning, and intuitions that work in low dimensions fail en masse. Counterintuitive, but unavoidable in high-dimensional data. ③ For BigCat: embedding vectors, semantic retrieval, vector databases all "define similarity in some learned low-dimensional space" — retrieval quality equals reduction quality. ④ But never forget: reduction is lossy — a map, not the territory itself. Compressing a person into three labels is efficient; the cost is that the dimension you compressed away is often exactly the one you most needed to see this time.

Classic example

Any world map: projecting a 3D earth onto a 2D sheet must distort something — the Mercator projection, to preserve angles, draws high-latitude Greenland far larger than it really is. Every reduction preserves one thing while sacrificing another; the question is always "which dimension should I preserve this time?"

Scenario · BigCat

RAG (retrieval-augmented generation) and semantic search: the system compresses both documents and the query into one embedding space and retrieves relevant chunks by cosine similarity — recall is only as good as whether this reduction preserved the few semantic axes that actually determine relevance. Everyday decisions work the same: evaluating a candidate or plan, you mentally reduce it to two or three dimensions and score it; the real risk is that you happened to project away the one dimension that mattered most. Just asking "did I compress away an important axis?" often rescues a bad judgment.

AI Prompt

English Prompt

I'm evaluating / comparing these options: [describe], aiming for [desired outcome]. Use the "dimensionality & similarity" lens to check me: 1. Which 2-3 axes am I implicitly scoring them on? Does this set omit the dimension most critical to the goal? 2. In the space I chose, "similar / better" holds — would the conclusion flip in another reasonable space? 3. Name one dimension I've projected away that could be decisive, and how to fold it back into the decision.