Day 3 · 2026.05.23

The Essence of Calculus

Seeing calculus through 3Blue1Brown's eyes
"Calculus is the mathematics of change, and the universe is changing." — Steven Strogatz, Infinite Powers

The Limit

The Bedrock of Analysis
Analysis
Intuition

Imagine you want to measure the area under a curve. Slice it into 10 rectangles — not accurate. 100 — better. 1,000, 10,000 — every additional cut shrinks the error a little more. What a limit wants to say is: "if I can imagine slicing forever — slicing to infinite thinness — the answer approaches a definite number."

The subtle point is this: we don't actually slice "to infinity." We say instead "as close as you want." That is the language of $\varepsilon$–$\delta$: give me any tiny error $\varepsilon$, and I can give you a step number $N$ past which the result is within $\varepsilon$ of the target. The limit isn't "the value at some mysterious point infinitely far away." It is "the value we can arbitrarily closely approach." Translating "infinity" into "finite, controllable approximation" is the great grammatical reconstruction performed by Cauchy and Weierstrass in the 19th century.

n=4 n=10 n→∞ ∫f
Why it's beautiful

The limit settles a question the Greeks couldn't answer in two thousand years: Zeno's paradox. Achilles cannot catch the tortoise, because each time he reaches the tortoise's previous position, the tortoise has crept a little further... The Greeks stayed stuck on "infinitely many steps cannot be completed." The limit answers them: infinitely many shrinking distances can add up to a finite sum ($\frac{1}{2}+\frac{1}{4}+\frac{1}{8}+\cdots = 1$). It is the first time in human history that "infinity" was tamed. Strogatz, in Infinite Powers, calls it "the deepest invention in mathematics."

Applications

Limits underlie every modern application of physics, engineering, and machine learning. "Let the step size go to zero" in gradient descent; "let the depth or width go to infinity" for the limiting behavior of a deep network; "let the mesh become arbitrarily fine" in numerical methods — all of it is the limit at work. In ML, the Neural Tangent Kernel (NTK) studies what neural networks become as their width tends to infinity, where, surprisingly, they become analytically tractable.

History

When Newton and Leibniz invented calculus in the 1670s, they used the vague notion of an infinitesimal — something smaller than any positive number, yet not zero. Bishop Berkeley scorned them as "the ghosts of departed quantities." It wasn't until Cauchy (in the 1820s) and Weierstrass (in the 1850s) introduced the $\varepsilon$–$\delta$ definition that calculus finally got rigorous foundations. A century and a half of "useful but not rigorous mathematics" came to an end.

Going deeper
  • 3Blue1Brown · Essence of Calculus, Ch.7: Limits (the visual intuition)
  • Strogatz · Infinite Powers (limits and Zeno)
English Insight
limit · ε–δ definition · infinitesimal · convergence. "In the limit" — the common phrasing for "as the parameter goes to its limiting value." "Approaches" / "tends to" — the verbs for convergence. "Rigorization" — the term for Weierstrass-style tightening of foundations.
Something to chew on
Does $0.999\ldots$ equal $1$? Most people instinctively feel "infinitely close, but a hair smaller." If you take the limit definition seriously, what's your answer? And what does the gap between those two intuitions say about the kind of mathematical universe you live in?

The Derivative — the Instantaneous Rate of Change

Differential Calculus
Differential Calculus
$$f'(x) = \lim_{h \to 0}\frac{f(x+h)-f(x)}{h}$$
Intuition

You're driving. The speedometer reads 60 km/h. What does the speed "right now" even mean? "Right now" is a single instant; in zero time you travel zero distance; 0/0 is meaningless. And yet the speedometer happily shows a number. The derivative is the resolution of that paradox: it's the value that the average speed converges to as the time window shrinks toward zero.

Geometrically, the derivative is the slope of the tangent line at a point on the graph. Zoom in, zoom in, zoom in — close enough in, any smooth curve starts to look like a straight line. The slope of that line is the derivative. It's the line 3Blue1Brown keeps repeating: "zoom in until curved becomes straight." The derivative isn't the magic of "dividing by zero" — it's the geometric fact that, under infinite magnification, curves become straight.

x secant, large h tangent = f'(x) x f(x)
Why it's beautiful

The derivative unifies "velocity," "slope," "marginal cost," "chemical reaction rate," "population growth rate," "the descent direction of a loss function" into one mathematical object. One concept, a thousand faces. That is the power of abstraction — discovering that things which look unrelated on the surface share the same underlying structure. Terry Tao has written: "The real magic of calculus isn't in the formulas — it is that it lets us speak systematically about change."

Applications

All of machine-learning training is: compute the derivative of the loss with respect to the weights, then step opposite to the gradient. Backpropagation is just the chain rule, $\frac{dL}{dw} = \frac{dL}{dy}\cdot\frac{dy}{dw}$, applied efficiently to billions of parameters. Every training step of GPT-5 is the derivative pointing the way. In physics, the $a$ in $F = ma$ is the second derivative of position with respect to time. In biology, growth is a first derivative; accelerating growth is a second.

History

In 1666 the plague closed Cambridge, and a 23-year-old Newton retreated to Woolsthorpe, where he independently invented his "method of fluxions." Almost simultaneously (the late 1670s), Leibniz developed the $dy/dx$ notation independently on the Continent. Their subsequent priority dispute is one of the most famous fights in mathematical history — but the notation we use today is almost entirely Leibniz's: cleaner, more structural, easier to manipulate.

Going deeper
  • 3Blue1Brown · Essence of Calculus, Ch.2: The paradox of the derivative
  • Spivak · Calculus (the rigorous classic)
English Insight
derivative · differentiable · tangent line · rate of change · chain rule. "Take the derivative with respect to $x$" — the verbal idiom for differentiation. "Smooth function" — differentiable to all orders. "Gradient" — the vector generalization of derivative to multiple variables.
Something to chew on
Why does $|x|$ (absolute value) fail to have a derivative at $x=0$? Sketch it — there's a corner at the origin. What does a "corner" mean mathematically? The ReLU activation used in neural networks has the same corner — how does gradient descent handle it?

The Integral — a Continuous Sum

Integral Calculus
Integral Calculus
$$\int_a^b f(x)\,dx = \lim_{n\to\infty}\sum_{i=1}^{n} f(x_i)\,\Delta x$$
Intuition

The integral sign $\int$ is an elongated S — for Sum. $dx$ is an "infinitesimally small width," $f(x)$ is "the height at that instant," and their product is "the area of a sliver-thin rectangle." Add infinitely many such slivers and you have the total area under the curve.

But "area" is only one face of the integral. What it really does is: accumulate a continuously varying quantity over an interval. Integrate velocity to get displacement. Integrate power to get energy. Integrate a probability density to get a probability. Integrate a growth rate to get a population. Integrate GPU utilization to get total compute consumed. Any "rate times time, summed up" is an integral. 3Blue1Brown's beautiful framing: integrals answer "how much, in total, happened?" while derivatives answer "how fast is it changing right now?"

Why it's beautiful

Integrals let us precisely compute the area of irregular shapes, the displacement under non-uniform acceleration, the mass of an object with non-uniform density — all the accumulations that elementary geometry and arithmetic cannot reach. This is the modern descendant of the Greek method of exhaustion: Archimedes used a similar idea to compute the area of a circle ($\pi r^2$) and the area inside a parabolic segment. Two thousand years later, Newton and Leibniz gave humanity a general-purpose machine for the trick. Hardy would call it a quiet miracle: the very fact that we can take infinitely many things which each "equal zero" and add them up to get a finite, non-zero number.

Applications

Total charge equals current integrated over time. The energy of a signal is $\int |x(t)|^2 dt$. The expectation in probability is $\int x\,p(x)\,dx$. In diffusion models, the training objective is an integral over noise level $t$. Maxwell's equations, the Schrödinger equation, the Black–Scholes equation, Navier–Stokes for fluids — essentially every equation of modern science is a differential or integral equation.

History

The idea of integration is older than differentiation. Archimedes (~250 BCE) had already used "exhaustion" to derive the volume of a sphere, $\frac{4}{3}\pi r^3$ — the result he was so proud of he asked to have it carved on his tombstone. Cavalieri proposed the principle of "indivisibles" in the early 17th century — a precursor to the integral. Leibniz wrote down the $\int$ symbol for the first time in his notebook on 29 October 1675 — one of the rare moments in mathematical history we can date to the day.

Going deeper
  • 3Blue1Brown · Essence of Calculus, Ch.1 & Ch.8 (the intuition of integrals and area)
  • Strogatz · Infinite Powers (the arc from Archimedes to Newton)
English Insight
integral · integrand · antiderivative · Riemann sum. Definite integral — over a fixed interval. Indefinite integral / antiderivative — a function whose derivative is the integrand. Integrand — the function being integrated. Riemann sum — the finite sum that approximates the integral.
Something to chew on
Consider $\int_0^\infty \frac{\sin x}{x}\,dx = \frac{\pi}{2}$. The integrand oscillates between positive and negative forever — and yet the answer is a clean $\pi/2$. Why does this kind of "infinite oscillation that nevertheless converges to $\pi$" make mathematicians feel that the universe has a strange, hidden order?

The Fundamental Theorem of Calculus

Differentiation and integration are two sides of the same coin
Fundamental Theorem
$$\int_a^b f(x)\,dx = F(b) - F(a),\quad \text{where } F'(x)=f(x)$$
Intuition

The derivative turns "accumulated quantity" into "rate of change" (position → velocity). The integral turns "rate of change" back into "accumulated quantity" (velocity → position). They are inverse operations. This seemingly modest fact is the reason the entire subject of calculus exists.

A deeper picture: let $F(x)$ denote "the accumulated area from the start up to $x$." Step forward by a tiny $dx$, and the additional area is approximately $f(x) \cdot dx$ — height times width. In other words, $dF = f\,dx$, i.e. $F'(x) = f(x)$. "The instantaneous rate of the accumulation function is exactly the quantity being accumulated." That is the geometric soul of the Fundamental Theorem. It turns "computing the area under a curve" — which looks like it should require infinitely many additions — into "find an antiderivative and subtract two of its values."

f(x) F(x) = ∫f dx dF = f·dx x
Why it's beautiful

Before the 17th century, "finding the area under a curve" (quadrature) and "finding the slope of a tangent" (the tangent problem) were two completely separate crafts: the former belonged to the Archimedean tradition, the latter to the analytic geometry of Fermat and Descartes. Newton and Leibniz independently discovered that they are inverses of each other. It's as if someone had suddenly proved that "going up the stairs" and "going down the stairs" are the same act in two directions — two thousand years of independent inquiry merging into a single unified discipline. It is one of the rare moments in mathematical history when a bridge is discovered and the two shores become one. Strogatz calls it "the miracle of calculus."

Applications

Every integral that ever shows up in physics, probability, or statistics is computed, fundamentally, by finding an antiderivative and subtracting at the endpoints. Numerically, this is the theoretical backbone of Simpson's rule, the Runge–Kutta methods, and friends. In automatic differentiation (PyTorch/JAX), the forward pass accumulates, the backward pass differentiates — the same duality, in algorithmic form. In physics, Stokes' theorem, Green's theorem, and the Gauss divergence theorem are higher-dimensional generalizations — the same melody played in different keys.

History

The germ of the theorem appears in the 1650s geometry lectures of Isaac Barrow (Newton's mentor at Cambridge) — but Barrow himself didn't realize how revolutionary it was. Newton and Leibniz systematized it from different angles. Curiously, neither used the modern notion of "function" — the concept of a function would have to wait for Euler in the 18th century. So the rigorous formulation we teach today is really Cauchy's, completed in the 19th century. Two hundred years from intuitive discovery to rigorous statement.

3Blue1Brown's framing
Grant Sanderson: "If you only remember one thing from a calculus course, let it be this: the derivative and the integral are inverses of each other. Every formula, every technique, every application grows out of that one sentence."
Going deeper
  • 3Blue1Brown · Essence of Calculus, Ch.8: Integration & the fundamental theorem
  • Tristan Needham · Visual Complex Analysis (calculus through a geometric lens)
English Insight
fundamental theorem of calculus (FTC) · antiderivative · evaluate at endpoints. "Inverse operations" — the relationship between differentiation and integration. "Accumulation function" — the $F(x) = \int_a^x f$. In higher dimensions, the generalization is Stokes' theorem: "the integral over a boundary equals the integral of the derivative over the interior."
Something to chew on
Suppose you measured only the hourly rainfall rate (in mm/hour) over a single day. How would you compute the total rainfall for that day? The entire spirit of the Fundamental Theorem hides inside this everyday question. Then ask the inverse: if all you have is the running total, how would you recover the rate? Standing between those two questions, can you feel the duality between derivative and integral?