5 Complex Numbers
In Section 4.4, we have proven the existence of \(\sqrt{x}\) for all for \(x \geq 0\). The number \(\sqrt{x}\) was defined as \[ \sqrt{x}:= \alpha \,, \quad \alpha := \sup \{ t \in \mathbb{R}\, \colon \,t^2 < x \}\,. \] We then proved that \[ \alpha^2 = x \,. \] This procedure is possible for any \(x \geq 0\).
Question 1
The answer to the above question is no. This is because \(\mathbb{R}\) is an ordered field, and from axiom (MO) it follows that: \[ x^2 \geq 0 \,, \quad \forall \, x \in \mathbb{R}\,. \] However we would still like to solve equation (5.1) somehow. To do this, we introduce the imaginary numbers or complex numbers. We define \(i\) to be that number such that \[ i^{2}=-1 \,. \] Formally, we can also think of \(i\) as \[ i =\sqrt{-1} \,. \] We can use this speacial number to define the square root of a negative real number \(x<0\): \[ \sqrt{x}: = i \sqrt{-x} \,. \] Note that \(\sqrt{-x}\) is properly defined in \(\mathbb{R}\), because \(-x\) is positive when \(x\) is negative.
5.1 The field \(\mathbb{C}\)
We would like to be able to do calculations with the newly introduced complex numbers, and investigate their properties. We can introduce them rigorously as a field, as we did for \(\mathbb{R}\).
Definition 2: Complex Numbers
\[ \mathbb{C}:= \mathbb{R}\oplus i \mathbb{R}:= \{x \oplus i y \, \colon \,x, y \in \mathbb{R}\} \,. \]
In the above the symbol \(\oplus\) is used to denote the pair \[ x \oplus i y = (x,y) \] with \(x,y \in \mathbb{R}\). This means that \(x\) and \(y\) play different roles.
Definition 3
For a complex number \[ z = x \oplus i y \in \mathbb{C} \] we say that
- \(x\) is the real part of \(z\), and denote it by \[ x = \operatorname{Re}(z) \]
- \(y\) is the imaginary part of \(z\), and denote it by \[ y = \operatorname{Im}(z) \]
We say that
- If \(\operatorname{Re}z = 0\) then \(z\) is a purely imaginary number.
- If \(\operatorname{Im}z = 0\) then \(z\) is a real number.
In order to make the set \(\mathbb{C}\) into a field, we first have to define the two binary operations of addition \(+\) and multiplication \(\cdot\), \[ + , \cdot \,\, \colon \mathbb{C}\times \mathbb{C}\to \mathbb{C}\,. \] Then we need to prove that these operations satisfy all the field axioms.
Definition 4: Addition in \(\mathbb{C}\)
Clearly, \(z_1 + z_2\) as defined above is an element of \(\mathbb{C}\). Therefore \(+\) defines a binary operation over \(\mathbb{C}\).
Notation 5
\[ (x \oplus i 0 )+(0 \oplus i y)= x \oplus i y . \] To simplify notation, we will write \[ x \oplus i 0 = x \,, \quad \, 0 \oplus i y = i y \] and \[ x \oplus i y = x + i y \,. \] We will also often swap \(i\) and \(y\), writing equivalently \[ x + i y = x + y i \,. \]
We now want to define multiplication between complex numbers.
Remark 6: Formal calculation for multiplication in \(\mathbb{C}\)
\[\begin{align*} z_1 \cdot z_2 & = \left(x_{1}+i y_{1} \right) \cdot \left(x_{2}+ i y_{2} \right) \\ & = x_{1} \cdot x_{2} + x_{1} \cdot i y_{2} + x_{2} \cdot i y_{1} + y_{1} \cdot i^2 y_{2} \\ & = \left(x_{1} \cdot x_{2} - y_{1} \cdot y_{2} \right) + i \left(x_{1} \cdot y_{2} + x_{2} \cdot y_{1} \right) \end{align*}\]
Remark 6 motivates the following definition of multiplication.
Definition 7: Multiplication in \(\mathbb{C}\)
Clearly, \(z_1 \cdot z_2\) as defined above is an element of \(\mathbb{C}\). Therefore \(\cdot\) defines a binary operation over \(\mathbb{C}\).
Remark 8
\[\begin{align*} i^{2} & = (0+ 1 i) \cdot (0+ 1 i) \\ & = (0 \cdot 0 - 1 \cdot 1) + (0 \cdot 1 + 0 \cdot 1) i = - 1 \,. \end{align*}\]
Important
Example 9
Solution. Using the definition we compute \[\begin{align*} z \cdot w & = (-2+3 i) \cdot (1 - i) \\ & = (-2-(-3))+(2+3) i \\ & = 1 + 5 i \, . \end{align*}\] Alternatively, we can proceed formally as in Remark 6. We just need to recall that \(i^2\) has to be replaced with \(-1\): \[\begin{align*} z \cdot w & = (-2+3 i) \cdot (1 - i) \\ & = - 2 + 2i + 3i - 3 i^2 \\ & = (-2 + 3 ) + ( 2 + 3 ) i \\ & = 1 + 5 i \, . \end{align*}\]
We now want to check that \[ (\mathbb{C}, + , \cdot) \] is a field. All the field axioms are trivial to check, except for the existence of additive and multiplicative inverses.
Proposition 10: Additive inverse in \(\mathbb{C}\)
The proof is immediate and is left as an exercise. The multiplication requires more care.
Remark 11: Formal calculation for multiplicative inverse
Motivated by the above remark, we define inverses in \(\mathbb{C}\) in the following way.
Proposition 12: Multiplicative inverse in \(\mathbb{C}\)
Proof
Important
Example 13
Solution. By the formula in Propostion 12 we immediately get \[ z^{-1} = \frac{3}{3^{2}+2^{2}} + \, \frac{-2}{3^{2}+2^{2}} \, i = \frac{3}{13}-\frac{2}{13} \, i \,. \] Alternatively, we can proceed formally as in Remark 11 \[\begin{align*} (3+2 i)^{-1} & = \frac{1}{3+2 i} \\ & = \frac{1}{3+2 i} \, \frac{3-2 i}{3-2 i} \\ & = \frac{3-2 i}{3^2+2^2} \\ & = \frac{3}{13}-\frac{2}{13} i \,, \end{align*}\] and obtain the same result.
We can now prove that \(\mathbb{C}\) is a field.
Theorem 14
Proof
(A1) To show that \(+\) is commutative, note that \[\begin{align*} (x+iy)+(a+ ib) & = (x+a)+ i (y+b) \\ & = (a + x)+ i (b+y) \\ & = (a + i b ) + (x + i y) \,, \end{align*}\] where we used Definition 64 in the first and last equality, and the commutative property of the real numbers (which holds since by definition \(\mathbb{R}\) is a field) in the second equality. Associativity can be checked in the same way.
(A2) The neutral element of addition is \(0\), as stated in Proposition 79.
(A3) Existence of additive inverses is given by Proposition 79.
For multiplication we have:
(M1) Commutativity and associativity of product in \(\mathbb{C}\) can be checked using Definition 7 and commutativity and associativity of sum and multiplication in \(\mathbb{R}\).
(M2) The neutral element of multiplication is \(1\), as stated in Propostion 12.
(M3) Existence of multiplicative inverses is guaranteed by Proposition 12.
Finally one should check the associative property (AM). This is left as an exercise.
5.1.1 Division in \(\mathbb{C}\)
Suppose we want to divide two complex numbers \(w,z \in \mathbb{C}, z \neq 0\), with \[ z = x + i y \,, \quad w = a + ib \,. \] We have two options:
Use the formula for the inverse from Proposition 12 and compute \[ z^{-1} := \frac{x}{x^{2}+y^{2}}+ i \, \frac{-y}{x^{2}+y^{2}} \,. \] Then we use the multiplication formula of Definition 7 to compute \[\begin{align*} \frac{w}{z} & = w \cdot z^{-1} \\ & = (a + i b) \, \cdot \left( \frac{x}{x^{2}+y^{2}}+ i \, \frac{-y}{x^{2}+y^{2}} \right) \\ & = \frac{ (ax + by) + i (bx - ay) }{ x^2 + y^2 } \end{align*}\]
Proceed formally as in Remark 11, using the multiplication by \(1\) trick. We would have \[\begin{align*} \frac{w}{z} & = \frac{ a + i b }{ x + i y } \\ & = \frac{ a + i b }{ x + i y } \, \frac{ x - i y }{ x - i y } \\ & = \frac{ (ax + by) + i (bx - ay) }{ x^2 + y^2 } \end{align*}\]
Example 15
Question. Let \(w=1+i\) and \(z=3-i\). Compute \(\frac{w}{z}\).
Solution. We compute \(w/z\) using the two options we have:
Using the formula for the inverse from Proposition 12 we compute \[\begin{align*} z^{-1} & = \frac{x}{x^{2}+y^{2}} + i \, \frac{-y}{x^{2}+y^{2}} \\ & = \frac{3}{3^2 + 1^2} - i \, \frac{-1}{3^2 + 1^2} \\ & = \frac{3}{10} + \frac{1}{10} \, i \end{align*}\] and therefore \[\begin{align*} \frac{w}{z} & = w \cdot z^{-1} \\ & = (1 + i) \, \left( \frac{3}{10} + \frac{1}{10} \, i \right) \\ & = \left(\frac{3}{10}-\frac{1}{10}\right)+\left(\frac{1}{10}+\frac{3}{10}\right) i \\ & = \frac{2}{10}+\frac{4}{10} i \\ & = \frac{1}{5}+\frac{2}{5} i \end{align*}\]
We proceed formally, using the multiplication by \(1\) trick. We have \[\begin{align*} \frac{w}{z} & = \frac{1+i}{3-i} \\ & = \frac{1+i}{3-i} \frac{3+i}{3+i} \\ & = \frac{3-1+(3+1) i}{3^2+1^2} \\ & =\frac{2}{10}+\frac{4}{10} i \\ & = \frac{1}{5}+\frac{2}{5} i \end{align*}\]
5.1.2 \(\mathbb{C}\) is not ordered
We have seen that \((\mathbb{C},+,\cdot)\) is a field. One might wonder whether \(\mathbb{C}\) is also an ordered field. It turns out that this is not the case.
Theorem 16
Proof
Hence, it is not possible to compare two complex numbers.
5.1.3 Completeness of \(\mathbb{C}\)
One might also wonder whether \(\mathbb{C}\) is complete. Our definition of completeness uses the notions of supremum and infimum, which require an order relation. However, \(\mathbb{C}\) is not ordered, as seen in Theorem 16. Therefore we cannot define infimum and supremum of subsets of \(\mathbb{C}\), and we have to find an alternative way to discuss completeness.
The right way to introduce completeness in \(\mathbb{C}\) is by using the notion of Cauchy sequence. In ordered fields, this new definition of completeness is equivalent to the definition which uses the supremum.
5.2 Complex conjugates
When computing inverses, we used the trick to multiply by \(1\): \[ z^{-1} = \frac{1}{z} \cdot 1 = \frac{1}{x + i y} \cdot \frac{x-iy}{x-iy} \,. \] The complex number \(x-iy\) is obtained by changing the sign to the imaginary part of \(z = x+iy\). We give a name to this operation.
Definition 17: Complex conjugate
\[ \bar{z}=x- i y \, . \]
Example 18
Complex conjugates have the following properties:
Theorem 19
For all \(z_1, z_2 \in \mathbb{C}\) it holds:
\(\overline{z_1 + z_2 }=\overline{z_1}+\overline{z_2}\)
\(\overline{z_1 \cdot z_2}=\overline{z_1} \cdot \overline{z_2}\)
Proof
Using the definition of addition in \(\mathbb{C}\) and of conjugate, \[\begin{align*} \overline{z_{1}+z_{2}} & =\overline{\left(x_{1}+i y_{1} \right)+\left(x_{2}+ i y_{2} \right)} \\ & =\overline{\left(x_{1}+x_{2}\right)+ i \left(y_{1}+y_{2}\right)} \\ & =\left(x_{1}+x_{2}\right)- i \left(y_{1}+y_{2}\right) \\ & =\left(x_{1}- i y_{1} \right) + \left(x_{2}-i y_{2} \right) \\ & =\overline{x_{1}+ i y_{1} } + \overline{x_{2}+i y_{2} } \\ & =\overline{z_{1}}+\overline{z_{2}} \,. \end{align*}\]
Using the definition of multiplication in \(\mathbb{C}\) and of conjugate,
\[\begin{align*} \overline{z_{1} \cdot z_{2}} & = \overline{\left(x_{1}+i y_{1} \right) \cdot\left(x_{2}+ i y_{2} \right)} \\ & = \overline{\left(x_{1} x_{2}-y_{1} y_{2}\right)+ i \left(x_{1} y_{2}+x_{2} y_{1}\right)} \\ & = \left(x_{1} x_{2}-y_{1} y_{2}\right)- i \left(x_{1} y_{2}+x_{2} y_{1}\right) \\ & =\left(x_{1}-i y_{1} \right) \cdot\left(x_{2}- i y_{2} \right) \\ & =\overline{z_{1}} \cdot \overline{z_{2}} \end{align*}\]
Example 20
Let \(z_{1}=3-4 i\) and \(z_{2}=-2+5 i\). Then
Let us check that \[ \overline{z_{1}+z_{2}}=\overline{z_{1}}+\overline{z_{2}} \] Indeed, we have \[ z_1 + z_2 = 1 + i \quad \implies \quad \overline{z_1 + z_2} = 1 - i \, . \] On the other hand \[ \overline{z_{1}} = 3 + 4i \,, \quad \overline{z_{2}} = -2 -5i \quad \implies \quad \overline{z_{1}} + \overline{z_{2}}= 1-i \,. \]
Let us check that \[ \overline{z_{1} \cdot z_{2}}=\overline{z_{1}} \cdot \overline{z_{2}} \] Indeed, \[\begin{align*} z_{1} \cdot z_{2} & = (3+4i) \cdot (-2 + 5i) \\ & = (- 6 + 20)+ (8 + 15 )i \\ & = 14 + 23 i \end{align*}\] so that \[ \overline{z_{1} \cdot z_{2}} = 14 - 23 i \] On the other hand: \[\begin{align*} \overline{z_{1}} \cdot \overline{z_{2}} & =(3+4 i) \cdot(-2-5 i) \\ & = (-6+20)+(-15-8) i \\ & = 14 - 23 i \end{align*}\]
5.3 The complex plane
We can represent a real number \(x\) as a point on the one-dimensional real line \(\mathbb{R}\). The distance between two real numbers \(x, y \in \mathbb{R}\) on the real line is given by \(|x-y|\), see Figure 5.1.
We would like to do something similar for the complex numbers, i.e. points \[ z = x + iy \,, \quad x , \, y \in \mathbb{R}\,. \] We represent \(z= x + i y\) in the two-dimensional plane at the point with (Cartesian) coordinates \((x, y)\). This two-dimensional plane in which we can represent all complex numbers is called the complex plane. The origin of such plane, with coordinates \((0,0)\), corresponds to the complex number \[ 0+0 i = 0 \,, \] see Figure 5.2.
5.3.1 Distance on \(\mathbb{C}\)
The Cartesian representation allows us to introduce a distance between two complex numbers. Let us start with the distance between a complex number \(z=x+i y\) and \(0\). By Pythagoras Theorem this distance is given by \[ \sqrt{x^{2}+y^{2}} \,, \] see Figure 5.2. We give a name to this quantity.
Definition 21: Modulus
Note that the distance between \(z\) and \(0\) is always a non-negative number.
Remark 22: Modulus of Real numbers
We can now define the distance between two complex numbers.
Definition 23: Distance in \(\mathbb{C}\)
The geometric intuition of why the quantity \(|z_1 - z_2|\) is defined as the distance between \(z_1\) and \(z_2\) is given in Figure 5.3.
Theorem 24
Proof
Example 25
Solution. The distance is \[\begin{align*} |z- w| & = |(2-4 i)-(-5+i)| \\ & = |7-5 i| \\ & =\sqrt{7^{2}+(-5)^{2}} \\ & =\sqrt{74} \end{align*}\]
5.3.2 Properties of modulus
The modulus has the following properties.
Theorem 26
Let \(z, z_{1}, z_{2} \in \mathbb{C}\). Then
\(\left|z_1 \cdot z_2\right|=\left|z_{1}\right|\left|z_{2}\right|\)
\(\left|z^{n}\right|=|z|^{n}\) for all \(n \in \mathbb{N}\)
\(z \cdot \bar{z}=|z|^{2}\)
Proof
Part 2. Exercise. It easily follows from Point 1 and induction.
Part 3. Let \(z=x+ i y\) for some \(x, y \in \mathbb{R}\). Then,
\[\begin{align*} z \cdot \bar{z} & = (x+i y)(x- i y) \\ & = x^{2}-(i y )^{2} \\ & = x^{2}+y^{2} \\ & =|z|^{2} \end{align*}\]
The modulus in \(\mathbb{C}\) satisfies the triangle inequality.
Theorem 27: Triangle inequality in \(\mathbb{C}\)
For all \(x, y, z \in \mathbb{C}\),
\(|x+y| \leq|x|+|y|\)
\(|x-z| \leq|x-y|+|y-z|\)
Proof
This last statement is clearly true, since \(a d-b c \in \mathbb{R}\). Therefore (5.3) holds, and so (5.2) follows.
Part 2. Using (5.2) we estimate \[ |x-z|=|x-y+y-z| \leq|x-y|+|y-z| . \]
Remark 28: Geometric interpretation of triangle inequality
5.4 Polar coordinates
We have seen that we can identify a complex number \(z=x+iy\) by a point in the complex plane with Cartesian coordinates \((x, y)\). We can also specify the point \((x,y)\) by using the so-called polar coordinates \((\rho, \theta)\), where
\(\rho\) is the distance between \(z\) and the origin \[ \rho = |z| = \sqrt{x^2 + y^2} \]
\(\theta\) is the angle between the line connecting the origin and \(z\) and the positive real axis, see Figure 5.5.
We give such angle a name.
Definition 29: Argument
Warning
Remark 30: Principal Value
Example 31
We can represent any non-zero complex number in polar coordinates.
Theorem 32: Polar coordinates
The proof of Theorem 77 is trivial, and is based on basic trigonometry and definition of \(\arg(z)\). Complex numbers in polar form can be useful. We give a name to such polar form.
Definition 33: Trigonometric form
Example 34
Solution. We have \[\begin{gather*} x = \rho \cos (\theta) = \sqrt{8} \cos \left( \frac{3}{4} \pi \right) = - \sqrt{8} \cdot \frac{\sqrt{2}}{2} = -2 \\ y = \rho \sin (\theta) = \sqrt{8} \sin \left( \frac{3}{4} \pi \right) = \sqrt{8} \cdot \frac{\sqrt{2}}{2} = 2 \,. \end{gather*}\] Therefore, the cartesian form of \(z\) is \[ z = x + i y = - 2 + 2 i \,. \]
As a consequence of Theorem 77 we obtain a formula for computing the argument.
Corollary 35: Computing \(\arg(z)\)
Proof
When \(x>0\): Then \(z = x + iy\) belongs to the 1st or 4th quadrant. Therefore the argument of \(z\) is \(\theta \in (-\pi/2,\pi /2)\). Such interval coincides with the domain of \(\tan\), and therefore \[ \theta =\arctan \left( \dfrac{y}{x} \right) \,. \]
When \(x<0\) and \(y \geq 0\): Then \(z = x + iy\) lies in the 2nd quadrant. Thus, the argument of \(z\) is \(\theta \in (\pi/2,\pi)\). Hence \[ \theta =\arctan \left( \dfrac{y}{x} \right) + \pi \,. \]
When \(x<0\) and \(y < 0\): Then \(z = x + iy\) lies in the 3rd quadrant. Thus, the argument of \(z\) is \(\theta \in (-\pi,\pi/2)\). Hence \[ \theta =\arctan \left( \dfrac{y}{x} \right) - \pi \,. \]
When \(x=0\) and \(y > 0\): Then \(z = iy\) is imaginary with \(y>0\), meaning that \(\theta = \pi/2\).
When \(x=0\) and \(y < 0\): Then \(z = iy\) is imaginary with \(y<0\), meaning that \(\theta = -\pi/2\).
This exhausts all the cases, and the proof is concluded.
Example 36
Solution. Using the formula for \(\arg\) in Corollary 98 we have \[\begin{align*} \arg (3+4 i) & =\arctan \left(\frac{4}{3}\right) \\ \arg (3-4 i) & =\arctan \left(-\frac{4}{3}\right) = -\arctan \left(\frac{4}{3}\right) \\ \arg (-3+4 i) & = \arctan \left(-\frac{4}{3}\right) + \pi = - \arctan \left(\frac{4}{3}\right) + \pi \\ \arg (-3-4 i) & = \arctan \left(\frac{4}{3}\right) - \pi \end{align*}\]
5.5 Exponential form
We have seen that we can represent complex numbers in
- Cartesian form
- Trigonometric form
We now introduce a third way of representing complex numbers: the exponential form. For this, we need Euler’s identity:
Theorem 37: Euler’s identity
Proof
Theorem 38
Proof
Theorem 39
Definition 40: Exponential form
Example 41
Solution. From Example 97 we know that \(z = -2+2i\) can be written in trigonometric form as \[ z = \sqrt{8} \left[ \cos \left( \frac{3}{4} \pi \right) + i \sin \left( \frac{3}{4} \pi \right) \right] \,. \] By Euler’s identity we hence obtain the exponential form \[ z=\sqrt{8} e^{i \frac{3}{4} \pi} \,. \]
Remark 42: Periodicity of exponential
For all \(k \in \mathbb{Z}\) we have \[ e^{i \theta}=e^{i(\theta+2 \pi k)} \,, \tag{5.5}\] meaning that the complex exponential is \(2\pi\)-periodic. As we did for the principal value of the argument, we select \(\theta\) in (A.18) so that \(\theta \in(-\pi, \pi]\).
Equation (A.18) follows immediately by Euler’s identity and periodicity of \(\cos\) and \(\sin\), since \[\begin{align*} e^{i(\theta+2 \pi k)} & = \cos(\theta+2 \pi k) + i \sin (\theta+2 \pi k) \\ & = \cos(\theta) + i \sin (\theta) = e^{i \theta} \,. \end{align*}\]
The exponential form is very useful for computing products and powers of complex numbers.
Proposition 43
The proof follows immediately by the properties of the exponential. Let us see some applications of Propostion 106.
Example 44
Question. Compute \((-2+2 i)^{4}\).
Solution. We have two possibilities:
Use the binomial theorem: \[\begin{align*} (-2+2 i)^{4} & =(-2)^{4}+\left(\begin{array}{l} 4 \\ 1 \end{array}\right)(-2)^{3} \cdot 2 i+\left(\begin{array}{l} 4 \\ 2 \end{array}\right)(-2)^{2} \cdot(2 i)^{2} \\ & \quad +\left(\begin{array}{l} 4 \\ 3 \end{array}\right)(-2) \cdot(2 i)^{3}+(2 i)^{4} \\ & =16-4 \cdot 8 \cdot 2 i-6 \cdot 4 \cdot 4+4 \cdot 2 \cdot 8 i+16 \\ & =16-64 i-96+64 i+16=-64 \,. \end{align*}\]
A much simpler calculation is possible by using the exponential form: We know that \[ -2+2 i = \sqrt{8} e^{i \frac{3}{4} \pi} \] by Example 104. Hence \[ (-2+2 i)^{4}=\left(\sqrt{8} e^{i \frac{3}{4} \pi}\right)^{4}=8^{2} e^{i 3 \pi}=-64 \,, \] where we used that \[ e^{i 3 \pi} = e^{i \pi} = \cos(\pi) + i \sin(\pi) = - 1 \] by \(2\pi\) periodicity of \(e^{i\theta}\) and Euler’s identity.
Definition 45: Complex exponential
The complex exponential behaves exactly as exponentials should.
Theorem 46
We still do not have the technical means to prove this Theorem. The idea is to express \(e^z\) as \[ e^z = \sum_{k=0}^\infty \frac{z^n}{n!} \,, \tag{5.7}\] where the convergence is intendend in the sense of complex series. The properties at (A.19) then follow from manipulating the series on the RHS of (5.7).
Example 47
Solution. We know that \[ |i| = 1 \,, \quad \arg(i) = \frac{\pi}{2} \,. \] Hence we can write \(i\) in exponential form \[ i= |i| e^{i\arg(i)} = e^{i \frac{\pi}{2}} \,. \] Therefore \[ i^{i}=\left(e^{i \frac{\pi}{2}}\right)^{i}=e^{i^2\frac{\pi}{2}}=e^{-\frac{\pi}{2}} \,. \]
5.6 Fundamental Theorem of Algebra
We started the introduction to complex numbers with the following question:
Question 48
The answer is no. For this reason we introduced the complex number \(i\), which satisfies \[ i^{2}=-1 \,. \] Therefore (5.8) has solution in \(\mathbb{C}\), with \(x = i\). Moreover, note that \[ (-i)^{2}=(-1)^{2} i^{2}=-1 \,. \] Hence (5.8) has two solutions in \(\mathbb{C}\), given by \[ x_1 = i \,, \quad x_2 = - i \,. \]
It turns out that the set \(\mathbb{C}\) is so large, that not only we are able to solve (5.8), but in fact any polynomial equation.
Theorem 49: Fundamental theorem of algebra
Theorem 111 says that equation (A.20) admits \(n\) complex solutions:
- We call these solutions zeros, or also roots.
- We call the expression (A.21) a factorization of the polynomial \(p_n\).
Several proofs of Theorem 111 exist in the literature, but they all use mathematical tools which are out of reach for now. Therefore we will not show a proof. For example one can prove Theorem 111 by
- Liouville’s theorem (complex analysis)
- Homotopy arguments (general topology)
- Fundamental Theorem of Galois Theory (algebra)
Example 50
Solution. The equation \(z^2 = -1\) is equivalent to \[ p(z) = 0 \,, \quad p(z):=z^{2}+1 \,. \] Since \(p\) has degree \(n=2\), the Fundamental Theorem of Algebra tells us that there are two solutions to (A.22). We have already seen that these two solutions are \(z=i\) and \(z=-i\). Then \(p\) factorizes as \[ p(z) = z^{2} + 1 = (z-i)(z+i) \,. \]
Example 51
Solution The associated polynomial equation is \[
p(z) = 0 \,, \quad p(z) := z^4 - 1 \,.
\] Since \(p\) has degree \(n=4\), the Fundamental Theorem of Algebra tells us that there are \(4\) solutions to (A.23). Let us find such solutions. We use the well known identity \[
a^2-b^2 = (a+b)(a-b) \,, \quad \forall \, a,b \in \mathbb{R}\,,
\] to factorize \(p\). We get: \[
p(z) = (z^4-1) = (z^2+1)(z^2-1) \,.
\] We know that \[
z^2 + 1 = 0
\] has solutions \(z = \pm i\). Instead
\[
z^2 - 1 = 0
\] has solutions \(x = \pm 1\). Hence, the four solutions of (A.23) are given by \[
z=1,-1, i,-i \,,
\] and \(p\) factorizes as \[
p(z) = z^4 - 1 = (z-1)(z+1)(z-i)(z+i) \,.
\]
Definition 52
Example 53
The equation \[ (z-1)(z-2)^{2}(z+i)^{3}=0 \] has 6 solutions:
- \(z=1\) with multiplicity \(1\)
- \(z=2\) with multiplicity \(2\)
- \(z=-i\) with multiplicity \(3\)
5.7 Solving polynomial equations
The non-factorized version of the polynomial \[ p(z) = (z-1)(z-2)^2(z+i)^3 \tag{5.13}\] from Example 115 is \[ \begin{aligned} p(z) = & z^{6}-(5-3 i) z^{5}+(5-15 i) z^{4} \\ & + (11+23 i) z^{3}-(24+7 i) z^{2}+(12-8 i) z+4 i \end{aligned} \tag{5.14}\] Question: How do we figure out the factorization at (5.13) if we are given \(p\) in the form at (5.13)? More in general, consider the following question:
Question 54
Answer: There is no general formula to solve (5.15) when \(n \geq 5\). This is the content of the Abel-Ruffini Theorem.
Theorem 55: Abel-Ruffini
Similarly to the Fundamental Theorem of Algebra, the proof of the Abel-Ruffini Theorem is out of reach for now. A proof can be carried out, for example, using Galois Theory.
There are however explicit formulas for solving (5.15) when \(p_n\) has degree \(n=2,3,4\).
5.7.1 Quadratic polynomials
Consider polynomial equations of order \(n=2\), that is, equations of the form \[ a z^{2}+b z+c=0 \,. \tag{5.16}\] When the coefficients \(a,b,c\) are real, the solutions are given by the well-known quadratic formula.
Proposition 56: Quadratic formula
- If \(\Delta > 0\) then (A.24) has two distinct real solutions \(z_1,z_2 \in \mathbb{R}\) given by \[ z_1 = \frac{-b - \sqrt{\Delta}}{2 a} \,, \quad z_2 = \frac{-b + \sqrt{\Delta}}{2 a} \,. \]
- If \(\Delta = 0\) then (A.24) has one real solution \(z \in \mathbb{R}\) with multiplicity \(2\). Such solution is given by \[ z = z_1 = z_2 = \frac{-b}{2 a} \,. \]
- If \(\Delta < 0\) then (A.24) has two distinct complex solutions \(z_1,z_2 \in \mathbb{C}\) given by \[ z_1 = \frac{-b - i\sqrt{-\Delta}}{2 a} \,, \quad z_2 = \frac{-b + i\sqrt{-\Delta}}{2 a} \,, \] where \(\sqrt{-\Delta} \in \mathbb{R}\), since \(-\Delta>0\).
In all cases, the polynomial at (A.24) factorizes as \[ a z^{2}+b z+c = a (z-z_1)(z-z_2) \,. \]
Example 57
Question. Solve the following equations:
- \(3 z^{2}-6 z+2 = 0\)
- \(4 z^{2}-8 z+4=0\)
- \(z^{2}+2 z+3=0\)
Solution.
We have that \[ \Delta = (-6)^{2}-4 \cdot 3 \cdot 2 = 12 > 0 \] Therefore the equation has two distinct real solutions, given by \[ z=\frac{-(-6) \pm \sqrt{12}}{2 \cdot 3}=\frac{6 \pm \sqrt{12}}{6}=1 \pm \frac{\sqrt{3}}{3} \] In particular we have the factorization \[ 3 z^{2}-6 z+2 = 3 \left( z - 1 - \frac{\sqrt{3}}{3} \right) \left( z - 1 + \frac{\sqrt{3}}{3} \right) \,. \]
We have that \[ \Delta = (-8)^{2}-4 \cdot 4 \cdot 4 = 0 \,. \] Therefore there exists one solution with multiplicity \(2\). This is given by \[ z=\frac{-(-8)}{2 \cdot 4} = 1 \,. \] In particular we have the factorization \[ 4 z^{2}-8 x+4 = 4 (z-1)^2 \,. \]
We have \[ \Delta = 2^{2}-4 \cdot 1 \cdot 3 = - 8 < 0 \,. \] Therefore there are two complex solutions given by \[ z=\frac{-2 \pm i \sqrt{8}}{2 \cdot 1} = -1 \pm i \sqrt{2} \,. \] In particular we have the factorization \[ z^{2}+2 z+3 = (z + 1 - i \sqrt{2}) (z + 1 + i\sqrt{2}) \,. \]
So far we have considered the polynomial equation \[ az^2 + bz + c = 0 \,, \tag{5.17}\] for \(a,b,c \in \mathbb{R}\) and \(a \neq 0\).
Question 58
If \(a, b, c \in \mathbb{C}\) then we might have \[ \Delta := b^{2}-4 a c \in \mathbb{C}\,. \] Therefore it is not clear how to compute \[ \sqrt{\Delta} \,. \] However, we can still use the quadratic equation to solve (5.17), as outlined in the next Proposition.
Proposition 59: Quadratic formula with complex coefficients
Remark 60
To this end, note that \[ \Delta \in \mathbb{R}\,, \] when \(a,b,c \in \mathbb{R}\). In this case the equation \[ z^2 = \Delta \] has the following solutions:
- If \(\Delta>0\) there are two real solutions \[ S_1 = -\sqrt{\Delta} \,, \quad S_2 = \sqrt{\Delta} \,. \]
- If \(\Delta = 0\) then \(0\) is the only solution with multiplicity \(2\). Hence \[ S_1 = S_2 = 0 \,. \]
- If \(\Delta<0\), there are two complex solutions \[ S_1 = - i \sqrt{-\Delta} \,, \quad S_2 = i \sqrt{-\Delta} \,. \]
Therefore the solutions \[ z_1 = \frac{-b + S_1}{2 a} \,, \quad z_2 = \frac{-b + S_2}{2 a} \,, \] given in Proposition 116 coincide with the ones given in Proposition 118.
Example 61
Solution. We have \[\begin{align*} \Delta & = (-(3+i))^{2}-4 \cdot \frac{1}{2} \cdot(4-i) \\ & = 8+6 i-8+2 i \\ & =8 i \,. \end{align*}\] Therefore \(\Delta \in \mathbb{C}\). We have to find solutions \(S_1\) and \(S_2\) to the equation \[ z^2 = \Delta = 8i \,. \tag{5.19}\] We look for solutions of the form \(z=a+ i b\). Then we must have that \[ z^{2}=(a+ ib)^{2}=a^{2}-b^{2}+2 a b i = 8 i \,. \] Thus \[ a^{2}-b^{2}=0 \,, \quad 2 a b = 8 \,. \] From the first equation we conclude that \(|a|=|b|\). From the second equation we have that \(ab=4\), and therefore \(a\) and \(b\) must have the same sign. Hence \(a=b\), and \[ 2 a b = 8 \quad \implies \quad a = b = \pm 2 \,. \] From this we conclude that the solutions to (A.26) are \[ S_{1} = 2+2 i \,, \quad S_{2}=-2-2 i \,. \] Hence the solutions to (A.25) are \[\begin{align*} z_1 & = \frac{3+i+S_{1}}{2 \cdot \frac{1}{2}} = 3 + i + S_{1} \\ & = 3 + i + 2 + 2i = 5 + 3i \,, \end{align*}\] and \[\begin{align*} z_2 & = \frac{3+i+S_{2}}{2 \cdot \frac{1}{2}} = 3+i+S_{2} \\ & = 3+i -2 - 2i = 1 - i \,. \end{align*}\] In particular, the given polynomial factorizes as \[\begin{align*} \frac{1}{2} z^{2}-(3+i) z+(4-i) & = \frac12 (z - z_1)(z-z_2) \\ & = \frac12 (z - 5 - 3i)(z - 1 + i) \,. \end{align*}\]
In the above example it was a bit laborious to compute \(S_{1}\) and \(S_{2}\). In Section 5.8 and Section A.6.7, we will explore a more general method to solve problems of the form \[ z^n = \Delta \,. \]
5.7.2 Higher order polynomials
Consider now polynomial equations \[ p_n(z) = 0 \,, \] with \(p_n\) of degree \(n=3,4\). Although general solution formulas exist for these cases, they are exceedingly lengthy, making them impractical for manual calculations. For example, Figure 5.8 shows the solution formula to the quartic equation \[ z^4 + az^3 + bz^2 + cz + d = 0 \,, \] when \(a,b,c,d \in \mathbb{R}\).
A more productive use of time is learning how to perform long polynomial division. A quick tutorial is available here: Polynomial_division.pdf.
Example 62
Solution. Using polynomial division, see Figure 5.9, we obtain \[ 6z^3+5z^2-7 = (3z^2-2z-1)(2z+3) + (8z-4) \,. \]
Sometimes, it is possible to solve equations of degree higher than 2, in case it is obvious from inspection that a certain number is a solution, e.g., when \(z=-1,0,1\) is a solution.
Example 63
Question. Consider the equation \[ z^{3}-7 z^{2}+6 z=0 \,. \]
- Check whether \(z=0,1,-1\) are solutions.
- Using your answer from Point 1, and polynomial division, find all the solutions.
Solution.
By direct inspection we see that \(z=0\) and \(z=1\) are solutions.
Since \(z = 0\) is a solution, we can factorize \[ z^{3}-7 z^{2}+6 z=z\left(z^{2}-7 z+6\right) \,. \] We could now use the quadratic formula on the term \(z^{2}-7 z+6\) to find the remaining two roots. However, we have already observed that \(z=1\) is a solution. Therefore \(z-1\) divides \(z^{2}-7 z+6\). Using polynomial long division, see Figure 5.10, we find that \[ \frac{z^{2}-7 z+6}{z-1}=z-6 \,. \] Therefore the last solution is \(z=6\), and \[ z^{3}-7 z^{2}+6 z=z(z-1)(z-6) \,. \]
Example 64
Solution. It is easy to see \(z=1\) is a solution. This means that \(z-1\) divides \(z^{3}-7 z+6\). By using polynomial long division, see Figure 5.11, we compute that \[ \frac{z^{3}-7 z+6}{z-1}=z^{2}+z-6 \,. \] We are now left to solve \[ z^{2}+z-6 = 0\,. \] Using the quadratic formula, we see that the above is solved by \(z=2\) and \(z=-3\). Therefore the given polynomial factorizes as \[ z^{3}-7 z+6 = (z-1)(z-2)(z+3) \,. \]
Example 65
Question. Consider the equation \[ z^3 + 5 z^2 + (5 - i) z + (4 - 4 i) = 0 \,. \]
- Check whether \(z = \pm i\) and \(z = 0, \pm 1\) are solutions.
- By using polynomial division with complex coefficients, find all the solutions.
Solution.
By direct inspection, we see that \(z = i\) is a solution.
Since \(z = i\) is a solution, we know that \(z-i\) divides \(z^3 + 5 z^2 + (5 - i) z + (4 - 4 i)\). We now perform polynomial division by using the same method employed to divide polynomials with real coefficients: We just need to be mindful of the fact that coefficients are now complex, and thus, addition and multiplication have to be carried out in \(\mathbb{C}\). Using polynomial long division, see Figure 5.12, we compute that \[ z^3 + 5 z^2 + (5 - i) z + (4 - 4 i) = (z-i)(z^2 + (5+i) z + (4-4i)) \,. \] We are now left to solve the second order equation \[ z^2 + (5+i) z + (4-4i) = 0 \,. \] We compute that \[ \Delta = b^2 - 4ac = (5+1)^2 - 4 (4-4i) = 4(5+4i) \,. \] We now need to solve \[ z^2 = \Delta \,. \] We look for a solution in the form of \(z=a+ib\). \[ z^{2}=(a+ ib)^{2}=a^{2}-b^{2}+2 a b i = 8 i \,. \] Thus \[ a^{2}-b^{2}=0 \,, \quad 2 a b = 8 \,. \] From the first equation we conclude that \(|a|=|b|\). From the second equation we have that \(ab=4\), and therefore \(a\) and \(b\) must have the same sign. Hence \(a=b\), and \[ 2 a b = 8 \quad \implies \quad a = b = \pm 2 \,. \] From this we conclude that the solutions to (A.26) are \[ S_{1} = 2+2 i \,, \quad S_{2}=-2-2 i \,. \] Hence the solutions to (A.25) are \[\begin{align*} z_1 & = \frac{3+i+S_{1}}{2 \cdot \frac{1}{2}} = 3 + i + S_{1} \\ & = 3 + i + 2 + 2i = 5 + 3i \,, \end{align*}\] and \[\begin{align*} z_2 & = \frac{3+i+S_{2}}{2 \cdot \frac{1}{2}} = 3+i+S_{2} \\ & = 3+i -2 - 2i = 1 - i \,. \end{align*}\] In particular, the given polynomial factorizes as \[\begin{align*} \frac{1}{2} z^{2}-(3+i) z+(4-i) & = \frac12 (z - z_1)(z-z_2) \\ & = \frac12 (z - 5 - 3i)(z - 1 + i) \,. \end{align*}\]
5.8 Roots of unity
Problem
Note that \(z=1\) is always a solution to (5.20) if \(n\) is even. In such case also \(z=-1\) is a solution. If we were only looking for solutions in \(\mathbb{R}\), these two would be the only solutions.
However, the Fundamental Theorem of Algebra, see Theorem 111, tells us that there are \(n\) complex solutions to (5.20).
Question 66
Example 67
The trick to find all \(n\) solutions to (5.20) is to use the exponential form.
Theorem 68
Proof
Definition 69
Example 70
Solution. The \(4\) solutions are given by \[ z_k = \exp \left(i \frac{2 \pi k}{4} \right) = \exp \left(i \frac{\pi k}{2} \right) \,, \] for \(k=0,1,2,3\). We compute: \[\begin{align*} z_0 & = e^{i 0} = 1 \,, & \quad & z_1 = e^{i \frac{\pi}{2}}=i \,, \\ z_2 & = e^{i \pi}=-1 \,, & \quad & z_3 = e^{i \frac{3 \pi}{2}}=-i \, . \end{align*}\] Note that for \(k=4\) we would again get the solution \(z=e^{i 2 \pi}=1\).
Example 71
Solution. The \(3\) solutions are given by \[ z_k = \exp \left( i \frac{2 \pi k}{3} \right) \,, \] for \(k=0,1,2\). We compute: \[ z_0=e^{i 0}=1, \quad z_1=e^{i \frac{2 \pi}{3}}, \quad z_2=e^{i \frac{4 \pi}{3}} . \] We can write \(z_1\) and \(z_2\) in cartesian form: \[ z_1 = e^{i \frac{2 \pi}{3}}=\cos \left(\frac{2 \pi}{3}\right)+i \sin \left(\frac{2 \pi}{3}\right)=-\frac{1}{2}+\frac{\sqrt{3}}{2} i \] and \[ z_2 = e^{i \frac{4 \pi}{3}}=\cos \left(\frac{4 \pi}{3}\right)+i \sin \left(\frac{4 \pi}{3}\right)=-\frac{1}{2}-\frac{\sqrt{3}}{2} i \,. \]
5.9 Roots in \(\mathbb{C}\)
Let \(n \in \mathbb{N}\) and \(c \in \mathbb{C}\). We want to give a meaning to \[ \sqrt[n]{c} \,. \] This means, we want to find all complex solutions to \[ z^{n}=c \,. \] The Fundamental Theorem of Algebra ensures that the above equation has \(n\) complex solutions. To find these solutions, we pass to the exponential form.
Theorem 72
Proof
Warning: The n-th root function is multi-valued
Example 73
Solution. Let \(c = -32\). We have \[ |c| = |-32|=32=2^{5}\,, \quad \theta = \arg (-32)=\pi \,. \] The \(5\) solutions are given by \[ z_k = \left(2^{5}\right)^{\frac{1}{5}} \exp \left(i \pi \, \frac{1+2 k}{5} \right) \,, \quad k \in \mathbb{Z}\,, \] for \(k=0,1,2,3,4\). We get \[\begin{align*} z_0 & = 2 e^{i \frac{\pi}{5}} \, & \quad & z_1 = 2 e^{i \frac{3 \pi}{5}} \\ z_2 & = 2 e^{i \pi}=-2 \, & \quad & z_3=2 e^{i \frac{7 \pi}{5}} \\ z_4 &= 2 e^{i \frac{9 \pi}{5}} & \quad & \end{align*}\]
Example 74
Solution. Set \[ c:=9\left(\cos \left(\frac{\pi}{3}\right)+i \sin \left(\frac{\pi}{3}\right)\right) \,. \] The complex number \(c\) is already in the trigonometric form, so that we can immediately obtain \[ |c| = 9 \,, \quad \theta = \arg(c) = \frac{\pi}{3} \,. \] The \(4\) solutions are given by \[\begin{align*} z_k & = \sqrt[4]{9} \, \exp \left( i \, \frac{ \pi/3 + 2 \pi k}{4} \right) \\ & = \sqrt{3} \exp \left( i \pi \, \frac{1+6 k}{12} \right) \end{align*}\] for \(k=0,1,2,3\). We compute \[\begin{align*} z_0 & = \sqrt{3} e^{i \pi \frac{1}{12}} & \quad & z_1 = \sqrt{3} e^{i \pi \frac{7}{12}} \\ z_2 & = \sqrt{3} e^{i \pi \frac{13}{12}} & \quad & z_3 = \sqrt{3} e^{i \pi \frac{19}{12}} \end{align*}\]