Complete methodology guide — from research question to final submission
IB EE Mathematics 4000 words 34 marks totalThe Extended Essay (EE) is a 4000-word independent research essay submitted as part of the IB Diploma. It is an opportunity to investigate a topic of personal interest in depth, demonstrating academic writing, research, and analytical skills.
A Mathematics EE is an independent mathematical investigation. Unlike a science EE (which requires experiments), a Maths EE is built on mathematical reasoning, proof, or modelling. The investigation should go beyond IB syllabus content while remaining accessible.
Explore a theorem, proof technique, number-theoretic property, or algebraic structure in depth. Example: properties of Fibonacci numbers, number bases, proof methods in combinatorics.
Apply mathematics (calculus, statistics, linear algebra) to a real-world context. Fit models to data, evaluate model accuracy, discuss limitations. Example: modelling epidemic spread using the SIR model.
Investigate the development of a mathematical idea, the historical significance of a result, or contributions of a mathematician. Must include substantial mathematical content — not just biography.
The research question (RQ) is the most important decision in your EE. A good RQ is specific, answerable, and genuinely investigable within 4000 words.
| Category | Example Research Question |
|---|---|
| Pure / Number Theory | "How does the Euclidean algorithm's efficiency compare to Bézout's identity method for finding GCD of large integers?" |
| Pure / Number Theory | "What are the properties of Mersenne primes, and what patterns emerge from their distribution?" |
| Pure / Geometry | "Which geometric constructions are possible using origami, compared to compass-and-straightedge methods?" |
| Applied / Statistics | "To what extent does the birthday paradox hold in real populations of IB school cohorts?" |
| Applied / Modelling | "How accurately can a logistic regression model predict outcomes in a binary classification dataset?" |
| Applied / Physics | "How effectively does the SIR model describe the spread of a contagious disease in a confined school population?" |
| Game Theory | "To what extent does Nash equilibrium predict observed behaviour in repeated Prisoner's Dilemma games?" |
| Historical | "How did Euler's approach to the Königsberg bridge problem establish the foundational concepts of graph theory?" |
| Historical | "How did the development of non-Euclidean geometry challenge the philosophical assumptions of 18th-century mathematics?" |
| Combinatorics | "How many distinct colouring patterns exist for a $3 \times 3$ grid using $k$ colours, accounting for rotational symmetry?" |
| Probability | "How does the Gambler's Ruin problem illuminate the limitations of the martingale betting strategy?" |
| Calculus | "How accurately can numerical integration methods (Trapezoid Rule, Simpson's Rule) approximate integrals with known exact values?" |
Frame a specific, answerable question. State clearly what you will investigate and what you expect to find (your conjecture or hypothesis). Define all terms in the RQ mathematically.
Example: "To what extent does the logistic growth model accurately describe population growth in isolated environments? I will test this by comparing logistic curve fits to historical population data from island populations, evaluating fit using $R^2$ and residual analysis."
Present the necessary theory, definitions, and theorems needed to understand your investigation. Show understanding beyond the IB syllabus. Cite sources for theorems you use but do not prove from scratch.
Be selective — include only what is relevant. Do not copy large sections of textbooks. Demonstrate your own understanding by explaining concepts in your own words.
This is the core of your essay (approximately 60% of the total length). Present examples, data, computations, or proofs systematically. Look for patterns, make conjectures, test them with further examples.
Mathematical claims must be justified. For pure math: attempt rigorous proof (by induction, contradiction, or direct argument). For applied math: test your model against data, compute error measures, discuss when the model breaks down. For historical analysis: evaluate the significance and correctness of historical arguments.
Critically discuss: What are the limitations of your approach? What assumptions did you make? What could be improved? What further investigations are possible? What surprised you? Personal reflection on the mathematical journey is valued in Criterion E (Engagement).
A well-structured EE is easy for the examiner to follow. The standard structure is:
A key assessment criterion is the quality of mathematical communication. The examiner assesses whether your mathematical writing is clear, precise, and follows accepted conventions.
x^2)Poor: "The sum of the first $n$ odd numbers equals $n^2$. Proof: You can check this for small cases."
(No proof, no notation, no rigour.)
Better:
Theorem: For all positive integers $n$, $\displaystyle\sum_{k=1}^n (2k-1) = n^2$.
Proof by induction: Base case: $n=1$: $\sum_{k=1}^1 (2k-1) = 1 = 1^2$. ✓
Inductive step: Assume true for $n=m$, i.e., $\sum_{k=1}^m (2k-1) = m^2$. Then for $n=m+1$: $$\sum_{k=1}^{m+1}(2k-1) = m^2 + (2(m+1)-1) = m^2 + 2m + 1 = (m+1)^2. \quad \square$$
The EE is marked out of 34 points across five criteria. Understanding these criteria helps you target your effort effectively.
Top marks require: precise RQ, well-defined method that you actually follow, comprehensive investigation that addresses the RQ directly.
This is the highest-weighted criterion. Errors in mathematics, undefined notation, or staying entirely within the syllabus all cost marks here.
Also 12 marks. Don't just compute — interpret, connect, prove. Ask "so what?" after every result.
The RPPF has three entries (submitted at the start, midpoint, and end of the process). Be honest and specific about your intellectual journey.
Introduction: Motivation (COVID-19, historical epidemics), context for mathematical modelling in epidemiology, statement of RQ, overview of method.
Background Mathematics:
Investigation:
Conclusion: The logistic model provides a reasonable description of single-wave outbreak dynamics (e.g., $R^2 = 0.96$ in the example dataset), but systematically underestimates the timing of the inflection point. Limitations: the model assumes homogeneous mixing and constant transmission rate — real populations are heterogeneous. Extensions: SIR model with vaccination compartment.
The following table classifies EE topic areas by typical difficulty and suitability for different students.
| Difficulty | Topic Area | Example Research Question |
|---|---|---|
| Accessible | Statistics / Probability | Birthday paradox verification in real school populations |
| Accessible | Number Theory | Properties of palindromic numbers in different bases |
| Accessible | Game Theory (intro) | Nash equilibrium analysis in simple 2×2 games |
| Accessible | Sequences & Series | Patterns in generalised Fibonacci sequences |
| Moderate | Calculus / Optimisation | Optimisation in economic models (Cobb–Douglas) |
| Moderate | Probability | Gambler's Ruin: analysis and simulation comparison |
| Moderate | Applied Modelling | Logistic model for population growth in island environments |
| Moderate | Combinatorics | Burnside's lemma applied to colouring problems |
| Challenging | Analysis | Convergence analysis of iterative root-finding methods |
| Challenging | Graph Theory | Chromatic polynomial and its properties for planar graphs |
| Challenging | Topology | Introduction to Euler characteristic and its invariance |
| Challenging | Differential Equations | Behaviour of solutions to the predator–prey (Lotka–Volterra) system |
scipy, matplotlib, numpy — curve fitting, numerical methods, data visualisationThe IB recommends MLA or APA format, used consistently throughout. Use a citation manager (Zotero, Mendeley) or the IBO's online bibliography generator. Every source consulted — books, websites, journal articles — must be cited.
| Milestone | Approximate Timeline | Key Deliverable |
|---|---|---|
| Topic brainstorm | DP Year 1, Term 1 | 3–5 candidate research questions |
| Supervisor meeting 1 | DP Year 1, Term 2 | Confirmed RQ, initial reading list |
| RPPF Entry 1 | After first supervisor meeting | Submitted to IBO coordinator |
| First draft | DP Year 1, Term 3 | Complete draft (all sections present) |
| Supervisor meeting 2 | DP Year 2, Term 1 | Feedback on draft, RPPF Entry 2 |
| Final submission | DP Year 2, early Term 2 | Final essay + RPPF Entry 3 |