Notes/Big-O, Big-Theta, Big-Omega: Formal Asymptotic Analysis

Big-O, Big-Theta, Big-Omega: Formal Asymptotic Analysis

Formal definitions of Big-O, Big-Theta, and Big-Omega notation with limit-based proofs, common growth rates, and why Big-O dominates interviews while Big-Theta is more precise.

2025-10-15AI-Synthesized from Personal Notes

Theory of ComputationBig OAsymptotic Analysis

Terminology

Term	Definition
Asymptotic analysis	Studying the behavior of a function as its input size $n$ approaches infinity, ignoring constant factors and lower-order terms
Big-O ($O$)	An upper bound on growth rate: $f(n) = O(g(n))$ means $f$ grows no faster than $g$ for sufficiently large $n$
Big-Omega ($\Omega$)	A lower bound on growth rate: $f(n) = \Omega(g(n))$ means $f$ grows at least as fast as $g$ for sufficiently large $n$
Big-Theta ($\Theta$)	A tight bound: $f(n) = \Theta(g(n))$ means $f$ grows at the same rate as $g$, combining both $O$ and $\Omega$
Witness pair	The specific constants $c$ and $n_0$ that satisfy an asymptotic definition, proving the bound holds
Growth rate	How quickly a function's output increases as its input increases, e.g. linear ($n$), quadratic ($n^2$), logarithmic ($\log n$)
Limit-based proof	Using $\lim_{n \to \infty} f(n)/g(n)$ to determine the asymptotic relationship between $f$ and $g$
Dominant term	The fastest-growing term in a polynomial or sum, which determines the overall asymptotic behavior
Little-o ($o$)	A strict upper bound: $f(n) = o(g(n))$ means $f$ grows strictly slower than $g$, not just "no faster"

What & Why

When you write an algorithm, you need a way to describe how its running time or memory usage scales with input size. Saying "my function takes 3.2 milliseconds" is useless because that number changes with hardware, language, and compiler optimizations. Asymptotic notation strips away those irrelevant details and captures the fundamental growth pattern.

Big-O, Big-Omega, and Big-Theta are the three core notations. Big-O gives an upper bound ("at most this fast"), Big-Omega gives a lower bound ("at least this fast"), and Big-Theta gives a tight bound ("exactly this fast, up to constants"). Together they form a complete toolkit for classifying algorithm performance.

In practice, interviews and casual conversation almost always use Big-O, even when Big-Theta would be more accurate. Saying "merge sort is $O(n \log n)$" is technically correct but imprecise: merge sort is actually $\Theta(n \log n)$ because it always does $n \log n$ work regardless of input. Understanding the distinction matters when you need to reason precisely about best-case, worst-case, and average-case behavior.

How It Works

Formal Definitions

The three notations are defined using existential quantifiers over constants and thresholds.

Big-O (upper bound): $f(n) = O(g(n))$ if there exist positive constants $c$ and $n_0$ such that $0 \le f(n) \le c \cdot g(n)$ for all $n \ge n_0$.

Big-Omega (lower bound): $f(n) = \Omega(g(n))$ if there exist positive constants $c$ and $n_0$ such that $0 \le c \cdot g(n) \le f(n)$ for all $n \ge n_0$.

Big-Theta (tight bound): $f(n) = \Theta(g(n))$ if there exist positive constants $c_1$, $c_2$, and $n_0$ such that $0 \le c_1 \cdot g(n) \le f(n) \le c_2 \cdot g(n)$ for all $n \ge n_0$.

Equivalently, $f(n) = \Theta(g(n))$ if and only if $f(n) = O(g(n))$ and $f(n) = \Omega(g(n))$.

The Limit Method

Instead of finding explicit witness constants, you can use limits:

$\lim_{n \to \infty} \frac{f(n)}{g(n)} = L$

If $0 < L < \infty$, then $f(n) = \Theta(g(n))$
If $L = 0$, then $f(n) = O(g(n))$ but $f(n) \ne \Omega(g(n))$, so $f$ grows strictly slower than $g$
If $L = \infty$, then $f(n) = \Omega(g(n))$ but $f(n) \ne O(g(n))$, so $f$ grows strictly faster than $g$

Worked Example: Proving $3n^2 + 5n + 2 = \Theta(n^2)$

Upper bound ($O$): For $n \ge 1$: $3n^2 + 5n + 2 \le 3n^2 + 5n^2 + 2n^2 = 10n^2$. So $c_2 = 10$, $n_0 = 1$.

Lower bound ($\Omega$): For all $n \ge 1$: $3n^2 + 5n + 2 \ge 3n^2$. So $c_1 = 3$, $n_0 = 1$.

Limit method: $\lim_{n \to \infty} \frac{3n^2 + 5n + 2}{n^2} = \lim_{n \to \infty} (3 + 5/n + 2/n^2) = 3$. Since $0 < 3 < \infty$, we confirm $\Theta(n^2)$.

Common Growth Rates

Why Big-O Dominates Interviews

In interviews, people say "this algorithm is $O(n \log n)$" when they really mean $\Theta(n \log n)$. Big-O became the default because it answers the most common question: "What is the worst this can get?" An upper bound is a guarantee. If your algorithm is $O(n^2)$, you know it will never be worse than quadratic, even if it is sometimes faster.

Big-Theta is more informative because it pins down the exact growth rate. Saying merge sort is $\Theta(n \log n)$ tells you it always does $n \log n$ work, not just "at most $n \log n$." But in casual usage, the distinction is often glossed over.

Hierarchy of Growth Rates

$O(1) \subset O(\log n) \subset O(\sqrt{n}) \subset O(n) \subset O(n \log n) \subset O(n^2) \subset O(n^3) \subset O(2^n) \subset O(n!)$

Complexity Analysis

Growth Rate	Name	$n = 10$	$n = 1000$	$n = 10^6$
$O(1)$	Constant	1	1	1
$O(\log n)$	Logarithmic	3.3	10	20
$O(n)$	Linear	10	1,000	1,000,000
$O(n \log n)$	Linearithmic	33	10,000	20,000,000
$O(n^2)$	Quadratic	100	1,000,000	$10^{12}$
$O(2^n)$	Exponential	1,024	$\approx 10^{301}$	Intractable
$O(n!)$	Factorial	3,628,800	Intractable	Intractable

Key relationships between the notations:

$f(n) = \Theta(g(n)) \iff f(n) = O(g(n)) \text{ and } f(n) = \Omega(g(n))$

$f(n) = o(g(n)) \implies f(n) = O(g(n)) \text{ but not vice versa}$

Implementation

ALGORITHM ProveUpperBound(f, g, c, n0)
INPUT: f: function, g: function, c: positive constant, n0: threshold
OUTPUT: TRUE if f(n) <= c * g(n) for all n >= n0
BEGIN
  FOR n <- n0 TO LARGE_VALUE DO
    IF f(n) > c * g(n) THEN
      RETURN FALSE
    END IF
  END FOR
  RETURN TRUE
END

ALGORITHM ClassifyGrowthByLimit(f, g)
INPUT: f: function, g: function
OUTPUT: relationship string
BEGIN
  L <- LIMIT(f(n) / g(n)) as n -> infinity

  IF L = 0 THEN
    RETURN "f(n) = o(g(n)), so f grows strictly slower than g"
  ELSE IF 0 < L < infinity THEN
    RETURN "f(n) = Theta(g(n)), tight bound confirmed"
  ELSE IF L = infinity THEN
    RETURN "f(n) = omega(g(n)), so f grows strictly faster than g"
  END IF
END

ALGORITHM FindDominantTerm(terms)
INPUT: terms: list of (coefficient, exponent) pairs representing a polynomial
OUTPUT: the dominant term's exponent
BEGIN
  maxExponent <- -infinity
  FOR EACH (coeff, exp) IN terms DO
    IF exp > maxExponent AND coeff != 0 THEN
      maxExponent <- exp
    END IF
  END FOR
  RETURN maxExponent
END

Real-World Applications

Algorithm selection: choosing between $O(n^2)$ insertion sort and $O(n \log n)$ merge sort depends on input size; for small $n$, the constant factors in insertion sort make it faster despite worse asymptotic complexity
Database query planning: query optimizers estimate the asymptotic cost of different execution plans (nested loop join at $O(n \cdot m)$ vs hash join at $O(n + m)$) to pick the fastest strategy
API rate limiting: understanding that a quadratic algorithm in a request handler will cause timeouts at scale, while a linear one will not
Competitive programming: given a time limit and $n \le 10^5$, you know an $O(n^2)$ solution will not pass but $O(n \log n)$ will
System capacity planning: predicting how server costs scale with user growth requires knowing whether core operations are logarithmic, linear, or worse

Key Takeaways

Big-O is an upper bound, Big-Omega is a lower bound, and Big-Theta is a tight bound that combines both
The limit method ($\lim f/g$) is often the fastest way to classify the relationship between two functions
Big-O dominates everyday usage because it answers "what is the worst case?", but Big-Theta is more precise when the best and worst cases match
Constant factors and lower-order terms are stripped away, which is the whole point: asymptotic analysis captures the shape of growth, not the exact count
The growth rate hierarchy from $O(1)$ to $O(n!)$ determines what is tractable at scale and what is not

2025-10-16

Recurrence Relations and the Master Theorem

How to solve recurrence relations using the Master Theorem, recursion tree method, and substitution method, with a detailed walkthrough of T(n) = 2T(n/2) + n.

2025-10-17

P vs NP: The Million-Dollar Question in Computer Science

Understanding the P vs NP problem, decision problems, polynomial-time verification, and why resolving this question would reshape cryptography, optimization, and all of engineering.

2025-10-18

NP-Complete Problems: The Hardest Problems in NP

Understanding NP-completeness, polynomial-time reductions, the Cook-Levin theorem, SAT, 3-SAT, and why proving any single NP-complete problem is in P would solve them all.