5  Complex Numbers

In , we have proven the existence of x for all for x0. The number x was defined as x:=α,α:=sup{tR:t2<x}. We then proved that α2=x. This procedure is possible for any x0.

Question 1
Is there a number αR such that (5.1)α2=1?

The answer to the above question is no. This is because R is an ordered field, and from axiom (MO) it follows that: x20,xR. However we would still like to solve equation () somehow. To do this, we introduce the imaginary numbers or complex numbers. We define i to be that number such that i2=1. Formally, we can also think of i as i=1. We can use this speacial number to define the square root of a negative real number x<0: x:=ix. Note that x is properly defined in R, because x is positive when x is negative.

5.1 The field C

We would like to be able to do calculations with the newly introduced complex numbers, and investigate their properties. We can introduce them rigorously as a field, as we did for R.

Definition 2: Complex Numbers
The set of complex numbers C is defined as

C:=RiR:={xiy:x,yR}.

In the above the symbol is used to denote the pair xiy=(x,y) with x,yR. This means that x and y play different roles.

Definition 3

For a complex number z=xiyC we say that

  • x is the real part of z, and denote it by x=Re(z)
  • y is the imaginary part of z, and denote it by y=Im(z)

We say that

  • If Rez=0 then z is a purely imaginary number.
  • If Imz=0 then z is a real number.

In order to make the set C into a field, we first have to define the two binary operations of addition + and multiplication , +,:C×CC. Then we need to prove that these operations satisfy all the field axioms.

Definition 4: Addition in C
Let z1,z2C, so that z1=x1iy1,z2=x2iy2, for some x1,x2,y1,y2R. We define the sum of z1 and z2 as z1+z2=(x1iy1)+(x2iy2):=(x1+x2)i(y1+y2) where the + symbol on the right hand side is the addition operator in R.

Clearly, z1+z2 as defined above is an element of C. Therefore + defines a binary operation over C.

Notation 5
From the above definition, we have that, for all x,yR,

(xi0)+(0iy)=xiy. To simplify notation, we will write xi0=x,0iy=iy and xiy=x+iy. We will also often swap i and y, writing equivalently x+iy=x+yi.

We now want to define multiplication between complex numbers.

Remark 6: Formal calculation for multiplication in C
How to define multiplication in C? Whatever the definition may be, at least it has to give that that i2=ii=1. Keeping the above in mind, let us do some formal calculations: For z1=x1+iy1,z2=x2+iy2 we have

z1z2=(x1+iy1)(x2+iy2)=x1x2+x1iy2+x2iy1+y1i2y2=(x1x2y1y2)+i(x1y2+x2y1)

Remark 6 motivates the following definition of multiplication.

Definition 7: Multiplication in C
Let z1,z2C, so that z1=x1iy1,z2=x2iy2, for some x1,x2,y1,y2R. Define the multiplication of z1 and z2 as z1z2=(x1+iy1)(x2+iy2):=(x1x2y1y2)+i(x1y2+x2y1), where the operations + and on the right hand side are the operations in R.

Clearly, z1z2 as defined above is an element of C. Therefore defines a binary operation over C.

Remark 8
To check that we have given a good definition of product, we should have that i2=1, as expected. Indeed:

i2=(0+1i)(0+1i)=(0011)+(01+01)i=1.

Important
In view of Remark 8, we see that he formal calculations in Remark 6 are compatible with the definition of multiplication of complex numbers. Therefore, it is not necessary to memorize the multiplication formula, but it suffices to carry out calculations as usual, and replace i2 by 1.

Example 9
Question. Compute zw, where z=2+3i,w=1i.

Solution. Using the definition we compute zw=(2+3i)(1i)=(2(3))+(2+3)i=1+5i. Alternatively, we can proceed formally as in Remark 6. We just need to recall that i2 has to be replaced with 1: zw=(2+3i)(1i)=2+2i+3i3i2=(2+3)+(2+3)i=1+5i.

We now want to check that (C,+,) is a field. All the field axioms are trivial to check, except for the existence of additive and multiplicative inverses.

Proposition 10: Additive inverse in C
The neutral element of addition in C is the number 0:=0+0i. For any z=x+iyC, the unique additive inverse is given by z:=xiy.

The proof is immediate and is left as an exercise. The multiplication requires more care.

Remark 11: Formal calculation for multiplicative inverse
Let us first carry our some formal calculations. Let z=x+iyC,z0. First, note that z1=(x+iy)(1+0i)=x+iy=z, and therefore 1 is the neutral element of multiplication. Thus, the inverse of z should be a complex number z1C such that zz1=1. We would like to define z1=1x+iy. Such number does not belong to C, as it is not of the form a+ib for some a,bR. However it is what the inverse should look like. Proceeding formally: 1x+iy=1x+iy1=1x+iyxiyxiy=xiyx2(iy)2=xiyx2+y2=xx2+y2+iyx2+y2. The right hand side is an element of C, and looks like a good candidate for z1.

Motivated by the above remark, we define inverses in C in the following way.

Proposition 12: Multiplicative inverse in C
The neutral element of multiplication in C is the number 1:=1+0i. For any z=x+iyC, the unique multiplicative inverse is given by z1:=xx2+y2+iyx2+y2.

Proof
It is immediate to check that 1 is the neutral element of multiplication in C. For the remaining part of the statement, set w:=xx2+y2+iyx2+y2. We need to check that zw=1 zw=(x+iy)(xx2+y2+iyx2+y2)=(x2x2+y2y(y)x2+y2)+i(x(y)x2+y2+xyx2+y2)=1, so indeed z1=w.

Important
It is not necessary to memorize the formula for z1. Indeed one can just remember the trick of multiplying by 1=xiyxiy, and proceed formally, as done in Remark 11.

Example 13
Question. Let z=3+2i. Compute z1.

Solution. By the formula in Propostion 12 we immediately get z1=332+22+232+22i=313213i. Alternatively, we can proceed formally as in Remark 11 (3+2i)1=13+2i=13+2i32i32i=32i32+22=313213i, and obtain the same result.

We can now prove that C is a field.

Theorem 14
(C,+,) is a field.

Proof
We need to check that all field axioms hold. For the addition we have

  • (A1) To show that + is commutative, note that (x+iy)+(a+ib)=(x+a)+i(y+b)=(a+x)+i(b+y)=(a+ib)+(x+iy), where we used Definition 64 in the first and last equality, and the commutative property of the real numbers (which holds since by definition R is a field) in the second equality. Associativity can be checked in the same way.

  • (A2) The neutral element of addition is 0, as stated in Proposition 79.

  • (A3) Existence of additive inverses is given by Proposition 79.

For multiplication we have:

  • (M1) Commutativity and associativity of product in C can be checked using Definition 7 and commutativity and associativity of sum and multiplication in R.

  • (M2) The neutral element of multiplication is 1, as stated in Propostion 12.

  • (M3) Existence of multiplicative inverses is guaranteed by Proposition 12.

Finally one should check the associative property (AM). This is left as an exercise.

5.1.1 Division in C

Suppose we want to divide two complex numbers w,zC,z0, with z=x+iy,w=a+ib. We have two options:

  1. Use the formula for the inverse from Proposition 12 and compute z1:=xx2+y2+iyx2+y2. Then we use the multiplication formula of Definition 7 to compute wz=wz1=(a+ib)(xx2+y2+iyx2+y2)=(ax+by)+i(bxay)x2+y2

  2. Proceed formally as in Remark 11, using the multiplication by 1 trick. We would have wz=a+ibx+iy=a+ibx+iyxiyxiy=(ax+by)+i(bxay)x2+y2

Example 15

Question. Let w=1+i and z=3i. Compute wz.

Solution. We compute w/z using the two options we have:

  1. Using the formula for the inverse from Proposition 12 we compute z1=xx2+y2+iyx2+y2=332+12i132+12=310+110i and therefore wz=wz1=(1+i)(310+110i)=(310110)+(110+310)i=210+410i=15+25i

  2. We proceed formally, using the multiplication by 1 trick. We have wz=1+i3i=1+i3i3+i3+i=31+(3+1)i32+12=210+410i=15+25i

5.1.2 C is not ordered

We have seen that (C,+,) is a field. One might wonder whether C is also an ordered field. It turns out that this is not the case.

Theorem 16
The field (C,+,) is not ordered.

Proof
Suppose that C is an ordered field, that is, there exists an order relation on C compatible with the operations + and . By axiom (MO) it follows that for all elements zC,z0, we have that z2>0. But since i2=1<0, we get a contradiction.

Hence, it is not possible to compare two complex numbers.

5.1.3 Completeness of C

One might also wonder whether C is complete. Our definition of completeness uses the notions of supremum and infimum, which require an order relation. However, C is not ordered, as seen in Theorem 16. Therefore we cannot define infimum and supremum of subsets of C, and we have to find an alternative way to discuss completeness.

The right way to introduce completeness in C is by using the notion of Cauchy sequence. In ordered fields, this new definition of completeness is equivalent to the definition which uses the supremum.

5.2 Complex conjugates

When computing inverses, we used the trick to multiply by 1: z1=1z1=1x+iyxiyxiy. The complex number xiy is obtained by changing the sign to the imaginary part of z=x+iy. We give a name to this operation.

Definition 17: Complex conjugate
Let z=x+iy. We call the complex conjugate of z, denoted by z¯, the complex number

z¯=xiy.

Example 18
We have the following conjugates: 3+4i=34i,34i=3+4i,3+4i=34i,34i=3+4i,3=3,4i=4i.

Complex conjugates have the following properties:

Theorem 19

For all z1,z2C it holds:

  • z1+z2=z1+z2

  • z1z2=z1z2

Proof
Let z1,z2C. Then z1=x1+iy1,z2=x2+iy2, for some x1,y1,x2,y2R.

  • Using the definition of addition in C and of conjugate, z1+z2=(x1+iy1)+(x2+iy2)=(x1+x2)+i(y1+y2)=(x1+x2)i(y1+y2)=(x1iy1)+(x2iy2)=x1+iy1+x2+iy2=z1+z2.

  • Using the definition of multiplication in C and of conjugate,

z1z2=(x1+iy1)(x2+iy2)=(x1x2y1y2)+i(x1y2+x2y1)=(x1x2y1y2)i(x1y2+x2y1)=(x1iy1)(x2iy2)=z1z2

Example 20

Let z1=34i and z2=2+5i. Then

  • Let us check that z1+z2=z1+z2 Indeed, we have z1+z2=1+iz1+z2=1i. On the other hand z1=3+4i,z2=25iz1+z2=1i.

  • Let us check that z1z2=z1z2 Indeed, z1z2=(3+4i)(2+5i)=(6+20)+(8+15)i=14+23i so that z1z2=1423i On the other hand: z1z2=(3+4i)(25i)=(6+20)+(158)i=1423i

5.3 The complex plane

We can represent a real number x as a point on the one-dimensional real line R. The distance between two real numbers x,yR on the real line is given by |xy|, see .

Figure 5.1: Two points x and y on the real line R. Their distance is |xy|.

We would like to do something similar for the complex numbers, i.e. points z=x+iy,x,yR. We represent z=x+iy in the two-dimensional plane at the point with (Cartesian) coordinates (x,y). This two-dimensional plane in which we can represent all complex numbers is called the complex plane. The origin of such plane, with coordinates (0,0), corresponds to the complex number 0+0i=0, see .

Figure 5.2: A point z=x+iyC can be represented on the complex plane by the point of coordinates (x,y). The distance between z and 0 is given by |z|=z2+y2.

5.3.1 Distance on C

The Cartesian representation allows us to introduce a distance between two complex numbers. Let us start with the distance between a complex number z=x+iy and 0. By Pythagoras Theorem this distance is given by x2+y2, see . We give a name to this quantity.

Definition 21: Modulus
The modulus of a complex number z=x+iy is defined by |z|:=x2+y2.

Note that the distance between z and 0 is always a non-negative number.

Remark 22: Modulus of Real numbers
A real number xR can be written as x=x+0iC. Hence the modulus of x is given by |x|=x2+02=x2. The above coincides with the absolute value of x. This explains why the notation for modulus in C is the same as the one for absolute value in R.

We can now define the distance between two complex numbers.

Definition 23: Distance in C
Given z1,z2C, we define the distance between z1 and z2 as the quantity |z1z2|.

The geometric intuition of why the quantity |z1z2| is defined as the distance between z1 and z2 is given in .

Figure 5.3: The difference z1z2 of the two points z1,z2C is given by the magenta vector. We define |z1z2| as the distance between z1 and z2.

Theorem 24
Given z1,z2C, we have |z1z2|=(x1x2)2+(y1y2)2.

Proof
We have z1z2=(x1x2)+i(y1y2). Therefore, by definition of modulus, |z1z2|=(x1x2)2+(y1y2)2.

Example 25
Question. Compute the distance between z=24i,w=5+i.

Solution. The distance is |zw|=|(24i)(5+i)|=|75i|=72+(5)2=74

5.3.2 Properties of modulus

The modulus has the following properties.

Theorem 26

Let z,z1,z2C. Then

  1. |z1z2|=|z1||z2|

  2. |zn|=|z|n for all nN

  3. zz¯=|z|2

Proof
Part 1. We have z1z2=(x1+iy1)(x2+iy2)=(x1x2y1y2)+i(x2y1+x1y2) and therefore |z1z2|=(x1x2y1y2)2+(x2y1+x1y2)2=x12x22+y12y22+x22y12+x12y22. On the other hand, |z1|=x12+y12,|z2|=x22+y22 so that |z1||z2|=x12+y12x22+y22=x12x22+y12y22+x22y12+x12y22 proving that |z1z2|=|z1||z2|.

Part 2. Exercise. It easily follows from Point 1 and induction.

Part 3. Let z=x+iy for some x,yR. Then,

zz¯=(x+iy)(xiy)=x2(iy)2=x2+y2=|z|2

The modulus in C satisfies the triangle inequality.

Theorem 27: Triangle inequality in C

For all x,y,zC,

  1. |x+y||x|+|y|

  2. |xz||xy|+|yz|

Proof
Part 1. Suppose that x=a+ib and y=c+id for a,b,c,dR. Then, |x+y|=|(a+c)+i(b+d)|=(a+c)2+(b+d)2. Therefore the inequality (5.2)|x+y||x|+|y| is equivalent to (5.3)(a+c)2+(b+d)2a2+b2+c2+d2. Now note that, for A,BR, we have that (5.4)A2B2|A||B|. In the two reverse implications below we will use (): (a+c)2+(b+d)2a2+b2+c2+d2(a+c)2+(b+d)2(a2+b2+c2+d2)2a2+2ac+c2+b2+2bd+d2a2+b2+2a2+b2c2+d2+c2+d2ac+bda2+b2c2+d2(ac+bd)2(a2+b2)(c2+d2)a2c2+2abcd+b2d2a2c2+a2d2+b2c2+b2d2a2d2+b2c22abcd0(adbc)20.

This last statement is clearly true, since adbcR. Therefore () holds, and so () follows.

Part 2. Using () we estimate |xz|=|xy+yz||xy|+|yz|.

Remark 28: Geometric interpretation of triangle inequality
We finally have a justification of why the inequality |xz||xy|+|yz| is called triangle inequality: By drawing three points x,y,zC in the complex plane, the distance between x and z is shorter than the distance to go from x to z via the point y, see .

Figure 5.4: Let x,y,zC. The distance between x and z is shorter than the distance to go from x to z via the point y.

5.4 Polar coordinates

We have seen that we can identify a complex number z=x+iy by a point in the complex plane with Cartesian coordinates (x,y). We can also specify the point (x,y) by using the so-called polar coordinates (ρ,θ), where

  • ρ is the distance between z and the origin ρ=|z|=x2+y2

  • θ is the angle between the line connecting the origin and z and the positive real axis, see .

Figure 5.5: Polar coordinates (ρ,θ) for the complex number zC.

We give such angle a name.

Definition 29: Argument
Let zC. The angle θ between the line connecting the origin and z and the positive real axis is called the argument of z, and is denoted by θ:=arg(z).

Warning
We always use angles in radians, not degrees. Make sure your calculator is set to radians if you want to use it to compute angles.

Remark 30: Principal Value
The argument of a complex number is not uniquely defined. We can always add an integer number of times 2π to the argument to specify the same point. We usually use the convention to choose the argument in the interval (π,π]. This is called the principal value of the argument function. Therefore the complex numbers in the upper half plane have a positive argument, and the ones in the lower half plane have a negative argument.

Example 31
We have the following arguments: arg(1)=0arg(i)=π2arg(1)=πarg(i)=π2arg(1+i)=14πarg(1i)=34π

We can represent any non-zero complex number in polar coordinates.

Theorem 32: Polar coordinates
Let zC with z=x+iy and z0. Then x=ρcos(θ),y=ρsin(θ), where ρ:=|z|=x2+y2,θ:=arg(z).

The proof of Theorem 77 is trivial, and is based on basic trigonometry and definition of arg(z). Complex numbers in polar form can be useful. We give a name to such polar form.

Definition 33: Trigonometric form
Let zC. The trigonometric form of z is z=|z|[cos(θ)+isin(θ)], where θ=arg(z).

Example 34
Question. Suppose that zC has polar coordinates ρ=8,θ=34π. Therefore, the trigonometric form of z is z=8[cos(34π)+isin(34π)]. Write z in cartesian form.

Solution. We have x=ρcos(θ)=8cos(34π)=822=2y=ρsin(θ)=8sin(34π)=822=2. Therefore, the cartesian form of z is z=x+iy=2+2i.

As a consequence of Theorem 77 we obtain a formula for computing the argument.

Corollary 35: Computing arg(z)
Let zC with z=x+iy and z0. Then arg(z)={arctan(yx) if x>0arctan(yx)+π if x<0andy0arctan(yx)π if x<0andy<0π2 if x=0andy>0π2 if x=0andy<0 where arctan is the inverse of tan.

Proof
Using the polar coordinates formulas from Theorem 77, we get yx=ρsin(θ)ρcos(θ)=tan(θ). The thesis can be obtained by carefully inverting the tangent: With reference to and , we have:

  1. When x>0: Then z=x+iy belongs to the 1st or 4th quadrant. Therefore the argument of z is θ(π/2,π/2). Such interval coincides with the domain of tan, and therefore θ=arctan(yx).

  2. When x<0 and y0: Then z=x+iy lies in the 2nd quadrant. Thus, the argument of z is θ(π/2,π). Hence θ=arctan(yx)+π.

  3. When x<0 and y<0: Then z=x+iy lies in the 3rd quadrant. Thus, the argument of z is θ(π,π/2). Hence θ=arctan(yx)π.

  4. When x=0 and y>0: Then z=iy is imaginary with y>0, meaning that θ=π/2.

  5. When x=0 and y<0: Then z=iy is imaginary with y<0, meaning that θ=π/2.

This exhausts all the cases, and the proof is concluded.

Figure 5.6: The definition of arg(z) depends on the position of z in the complex plane.
Figure 5.7: Plot of the tangent function for θ between 3π2 and 3π2.

Example 36
Question. Compute the arguments of the complex numbers z=3+4i,z¯=34i,z¯=3+4i,z=34i.

Solution. Using the formula for arg in Corollary 98 we have arg(3+4i)=arctan(43)arg(34i)=arctan(43)=arctan(43)arg(3+4i)=arctan(43)+π=arctan(43)+πarg(34i)=arctan(43)π

5.5 Exponential form

We have seen that we can represent complex numbers in

  • Cartesian form
  • Trigonometric form

We now introduce a third way of representing complex numbers: the exponential form. For this, we need Euler’s identity:

Theorem 37: Euler’s identity
For all θR it holds eiθ=cos(θ)+isin(θ).

Proof
The proof of this theorem uses complex power series. We have not yet introduced series. However, we carry out the proof formally, assuming that all the quantities below converge. We have the following Taylor series at x0=0 (you might know them from calculus): ex=1+x+x22!+x33!+x44!+x55!+x66!+x77!+sin(x)=x1!x33!+x55!x77!+cos(x)=1x22!+x44!x66!+ The above identities also hold for xC. Hence we can substitute x=iθ in the series for ex to obtain eiθ=1+iθ+(iθ)22!+(iθ)33!+(iθ)44!+(iθ)55!+(iθ)66!+(iθ)77!=1+iθθ22!iθ33!+θ44!+iθ55!θ66!iθ77!+=cos(θ)+isin(θ), where in the second equality we used that i2=1, and where the third equality follows by observing that all terms with an even power of θ are exactly the terms in the expansion of cos(θ), and all terms with an odd power of θ are exactly the terms in the expansion of sin(θ) multiplied by i.

Theorem 38
For all θR it holds |eiθ|=1.

Proof
From Euler’s identity in Theorem 100 we get |eiθ|=|cos(θ)+isin(θ)|=cos2(θ)+sin2(θ)=1.

Theorem 39
Let zC with z=x+iy and z0. Then z=ρeiθ, where ρ:=|z|=x2+y2,θ:=arg(z).

Proof
By Theorem 77 we have x=ρcos(θ),y=ρsin(θ). Hence z=x+iy=ρcos(θ)+iρsin(θ)=ρeiθ, where in the last line we used Euler’s identity in Theorem 100.

Definition 40: Exponential form
The exponential form of a complex number zC is z=ρeiθ=|z|eiarg(z).

Example 41
Question. Write the number z=2+2i in exponential form.

Solution. From Example 97 we know that z=2+2i can be written in trigonometric form as z=8[cos(34π)+isin(34π)]. By Euler’s identity we hence obtain the exponential form z=8ei34π.

Remark 42: Periodicity of exponential
For all kZ we have (5.5)eiθ=ei(θ+2πk), meaning that the complex exponential is 2π-periodic. As we did for the principal value of the argument, we select θ in () so that θ(π,π].

Proof
Equation () follows immediately by Euler’s identity and periodicity of cos and sin, since ei(θ+2πk)=cos(θ+2πk)+isin(θ+2πk)=cos(θ)+isin(θ)=eiθ.

The exponential form is very useful for computing products and powers of complex numbers.

Proposition 43
Let z,z1,z2C and suppose that z=ρeiθ,z1=ρ1eiθ1,z2=ρ2eiθ2. We have z1z2=ρ1ρ2ei(θ1+θ2),zn=ρneinθ, for all nN.

The proof follows immediately by the properties of the exponential. Let us see some applications of Propostion 106.

Example 44

Question. Compute (2+2i)4.

Solution. We have two possibilities:

  1. Use the binomial theorem: (2+2i)4=(2)4+(41)(2)32i+(42)(2)2(2i)2+(43)(2)(2i)3+(2i)4=16482i644+428i+16=1664i96+64i+16=64.

  2. A much simpler calculation is possible by using the exponential form: We know that 2+2i=8ei34π by Example 104. Hence (2+2i)4=(8ei34π)4=82ei3π=64, where we used that ei3π=eiπ=cos(π)+isin(π)=1 by 2π periodicity of eiθ and Euler’s identity.

Definition 45: Complex exponential
The complex exponential of z=a+ibC is defined as ez=eaeib.

The complex exponential behaves exactly as exponentials should.

Theorem 46
Let z,wC. Then (5.6)ez+w=ezew,(ez)w=ezw.

We still do not have the technical means to prove this Theorem. The idea is to express ez as (5.7)ez=k=0znn!, where the convergence is intendend in the sense of complex series. The properties at () then follow from manipulating the series on the RHS of ().

Example 47
Question. Compute ii.

Solution. We know that |i|=1,arg(i)=π2. Hence we can write i in exponential form i=|i|eiarg(i)=eiπ2. Therefore ii=(eiπ2)i=ei2π2=eπ2.

5.6 Fundamental Theorem of Algebra

We started the introduction to complex numbers with the following question:

Question 48
Is there a number xR such that (5.8)x2=1?

The answer is no. For this reason we introduced the complex number i, which satisfies i2=1. Therefore () has solution in C, with x=i. Moreover, note that (i)2=(1)2i2=1. Hence () has two solutions in C, given by x1=i,x2=i.

It turns out that the set C is so large, that not only we are able to solve (), but in fact any polynomial equation.

Theorem 49: Fundamental theorem of algebra
Let pn(z) be a polynomial of degree n with complex coefficients, i.e., pn(z)=anzn+an1zn1++a1z+a0, for some coefficients an,,a0C with an0. There exist z1,,znC solutions to the polynomial equation (5.9)pn(z)=anzn+an1zn1++a1z+a0=0. In particular, pn factorizes as (5.10)pn(z)=an(zz1)(zz2)(zzn).

Theorem 111 says that equation () admits n complex solutions:

  • We call these solutions zeros, or also roots.
  • We call the expression () a factorization of the polynomial pn.

Several proofs of Theorem 111 exist in the literature, but they all use mathematical tools which are out of reach for now. Therefore we will not show a proof. For example one can prove Theorem 111 by

  • Liouville’s theorem (complex analysis)
  • Homotopy arguments (general topology)
  • Fundamental Theorem of Galois Theory (algebra)

Example 50
Question. Find all the complex solutions to (5.11)z2=1

Solution. The equation z2=1 is equivalent to p(z)=0,p(z):=z2+1. Since p has degree n=2, the Fundamental Theorem of Algebra tells us that there are two solutions to (). We have already seen that these two solutions are z=i and z=i. Then p factorizes as p(z)=z2+1=(zi)(z+i).

Example 51
Question. Find all the complex solutions to (5.12)z41=0.

Solution The associated polynomial equation is p(z)=0,p(z):=z41. Since p has degree n=4, the Fundamental Theorem of Algebra tells us that there are 4 solutions to (). Let us find such solutions. We use the well known identity a2b2=(a+b)(ab),a,bR, to factorize p. We get: p(z)=(z41)=(z2+1)(z21). We know that z2+1=0 has solutions z=±i. Instead
z21=0 has solutions x=±1. Hence, the four solutions of () are given by z=1,1,i,i, and p factorizes as p(z)=z41=(z1)(z+1)(zi)(z+i).

Definition 52: Multiplicity
Suppose that the polynomial pn factorizes as pn(z)=an(zz1)k1(zz2)k2(zzm)km with an0, z1,,zmC and k1,,kmN, ki1. In this case pn has degree n=k1++km=i=1mki. Note that zi is solves the equation pn(z)=0 exactly ki times. We call ki the multiplicity of the solution zi.

Example 53

The equation (z1)(z2)2(z+i)3=0 has 6 solutions:

  • z=1 with multiplicity 1
  • z=2 with multiplicity 2
  • z=i with multiplicity 3

5.7 Solving polynomial equations

The non-factorized version of the polynomial (5.13)p(z)=(z1)(z2)2(z+i)3 from Example 115 is (5.14)p(z)=z6(53i)z5+(515i)z4+(11+23i)z3(24+7i)z2+(128i)z+4i Question: How do we figure out the factorization at () if we are given p in the form at ()? More in general, consider the following question:

Question 54
The Fundamental Theorem of Algebra states that (5.15)pn(z)=0 has n complex solutions. How do we find such solutions in practice?

Answer: There is no general formula to solve () when n5. This is the content of the Abel-Ruffini Theorem.

Theorem 55: Abel-Ruffini
There is no elementary solution formula to the polynomial equation pn(z)=0, with pn polynomial of degree n, with n5.

Similarly to the Fundamental Theorem of Algebra, the proof of the Abel-Ruffini Theorem is out of reach for now. A proof can be carried out, for example, using Galois Theory.

There are however explicit formulas for solving () when pn has degree n=2,3,4.

5.7.1 Quadratic polynomials

Consider polynomial equations of order n=2, that is, equations of the form (5.16)az2+bz+c=0. When the coefficients a,b,c are real, the solutions are given by the well-known quadratic formula.

Proposition 56: Quadratic formula
Let a,b,cR,a0 and consider the equation at (). Define Δ:=b24acR. The following hold:

  1. If Δ>0 then () has two distinct real solutions z1,z2R given by z1=bΔ2a,z2=b+Δ2a.
  2. If Δ=0 then () has one real solution zR with multiplicity 2. Such solution is given by z=z1=z2=b2a.
  3. If Δ<0 then () has two distinct complex solutions z1,z2C given by z1=biΔ2a,z2=b+iΔ2a, where ΔR, since Δ>0.

In all cases, the polynomial at () factorizes as az2+bz+c=a(zz1)(zz2).

Example 57

Question. Solve the following equations:

  1. 3z26z+2=0
  2. 4z28z+4=0
  3. z2+2z+3=0

Solution.

  1. We have that Δ=(6)2432=12>0 Therefore the equation has two distinct real solutions, given by z=(6)±1223=6±126=1±33 In particular we have the factorization 3z26z+2=3(z133)(z1+33).

  2. We have that Δ=(8)2444=0. Therefore there exists one solution with multiplicity 2. This is given by z=(8)24=1. In particular we have the factorization 4z28x+4=4(z1)2.

  3. We have Δ=22413=8<0. Therefore there are two complex solutions given by z=2±i821=1±i2. In particular we have the factorization z2+2z+3=(z+1i2)(z+1+i2).

So far we have considered the polynomial equation (5.17)az2+bz+c=0, for a,b,cR and a0.

Question 58
What if a,b,cC?

If a,b,cC then we might have Δ:=b24acC. Therefore it is not clear how to compute Δ. However, we can still use the quadratic equation to solve (), as outlined in the next Proposition.

Proposition 59: Quadratic formula with complex coefficients
Let a,b,cC,a0. The two solutions to az2+bz+c=0 are given by z1=b+S12a,z2=b+S22a, where S1 and S2 are the two solutions to z2=Δ,Δ:=b24ac.

Remark 60
We want to check that the new formulas z1=b+S12a,z2=b+S22a,S12=S22=Δ, agree with the old formulas given in Proposition 116, when a,b,cR.

To this end, note that ΔR, when a,b,cR. In this case the equation z2=Δ has the following solutions:

  • If Δ>0 there are two real solutions S1=Δ,S2=Δ.
  • If Δ=0 then 0 is the only solution with multiplicity 2. Hence S1=S2=0.
  • If Δ<0, there are two complex solutions S1=iΔ,S2=iΔ.

Therefore the solutions z1=b+S12a,z2=b+S22a, given in Proposition 116 coincide with the ones given in Proposition 118.

Example 61
Question Find all the solutions to (5.18)12z2(3+i)z+(4i)=0.

Solution. We have Δ=((3+i))2412(4i)=8+6i8+2i=8i. Therefore ΔC. We have to find solutions S1 and S2 to the equation (5.19)z2=Δ=8i. We look for solutions of the form z=a+ib. Then we must have that z2=(a+ib)2=a2b2+2abi=8i. Thus a2b2=0,2ab=8. From the first equation we conclude that |a|=|b|. From the second equation we have that ab=4, and therefore a and b must have the same sign. Hence a=b, and 2ab=8a=b=±2. From this we conclude that the solutions to () are S1=2+2i,S2=22i. Hence the solutions to () are z1=3+i+S1212=3+i+S1=3+i+2+2i=5+3i, and z2=3+i+S2212=3+i+S2=3+i22i=1i. In particular, the given polynomial factorizes as 12z2(3+i)z+(4i)=12(zz1)(zz2)=12(z53i)(z1+i).

In the above example it was a bit laborious to compute S1 and S2. In and , we will explore a more general method to solve problems of the form zn=Δ.

5.7.2 Higher order polynomials

Consider now polynomial equations pn(z)=0, with pn of degree n=3,4. Although general solution formulas exist for these cases, they are exceedingly lengthy, making them impractical for manual calculations. For example, shows the solution formula to the quartic equation z4+az3+bz2+cz+d=0, when a,b,c,dR.

Figure 5.8: Quartic formula to solve z4+az3+bz2+cz+d=0. Image from Wikipedia.

A more productive use of time is learning how to perform long polynomial division. A quick tutorial is available here: Polynomial_division.pdf.

Example 62
Question. Divide 6z3+5z27 by 3z22z1.

Solution. Using polynomial division, see , we obtain 6z3+5z27=(3z22z1)(2z+3)+(8z4).

Figure 5.9: Example of polynomial long division between 6z3+5z27 and 3z22z1.

Sometimes, it is possible to solve equations of degree higher than 2, in case it is obvious from inspection that a certain number is a solution, e.g., when z=1,0,1 is a solution.

Example 63

Question. Consider the equation z37z2+6z=0.

  1. Check whether z=0,1,1 are solutions.
  2. Using your answer from Point 1, and polynomial division, find all the solutions.

Solution.

  1. By direct inspection we see that z=0 and z=1 are solutions.

  2. Since z=0 is a solution, we can factorize z37z2+6z=z(z27z+6). We could now use the quadratic formula on the term z27z+6 to find the remaining two roots. However, we have already observed that z=1 is a solution. Therefore z1 divides z27z+6. Using polynomial long division, see , we find that z27z+6z1=z6. Therefore the last solution is z=6, and z37z2+6z=z(z1)(z6).

Figure 5.10: Polynomial long division between z27z+6 and z1.

Example 64
Question. Find all the complex solutions to z37z+6=0.

Solution. It is easy to see z=1 is a solution. This means that z1 divides z37z+6. By using polynomial long division, see , we compute that z37z+6z1=z2+z6. We are now left to solve z2+z6=0. Using the quadratic formula, we see that the above is solved by z=2 and z=3. Therefore the given polynomial factorizes as z37z+6=(z1)(z2)(z+3).

Figure 5.11: Polynomial long division between z37z+6 and z1.
Example 65

Question. Consider the equation z3+5z2+(5i)z+(44i)=0.

  1. Check whether z=±i and z=0,±1 are solutions.
  2. By using polynomial division with complex coefficients, find all the solutions.

Solution.

  1. By direct inspection, we see that z=i is a solution.

  2. Since z=i is a solution, we know that zi divides z3+5z2+(5i)z+(44i). We now perform polynomial division by using the same method employed to divide polynomials with real coefficients: We just need to be mindful of the fact that coefficients are now complex, and thus, addition and multiplication have to be carried out in C. Using polynomial long division, see , we compute that z3+5z2+(5i)z+(44i)=(zi)(z2+(5+i)z+(44i)). We are now left to solve the second order equation z2+(5+i)z+(44i)=0. We compute that Δ=b24ac=(5+1)24(44i)=4(5+4i). We now need to solve z2=Δ. We look for a solution in the form of z=a+ib. z2=(a+ib)2=a2b2+2abi=8i. Thus a2b2=0,2ab=8. From the first equation we conclude that |a|=|b|. From the second equation we have that ab=4, and therefore a and b must have the same sign. Hence a=b, and 2ab=8a=b=±2. From this we conclude that the solutions to () are S1=2+2i,S2=22i. Hence the solutions to () are z1=3+i+S1212=3+i+S1=3+i+2+2i=5+3i, and z2=3+i+S2212=3+i+S2=3+i22i=1i. In particular, the given polynomial factorizes as 12z2(3+i)z+(4i)=12(zz1)(zz2)=12(z53i)(z1+i).

Figure 5.12: Polynomial long division between z3+5z2+(5i)z+(44i) and zi.

5.8 Roots of unity

Problem
Let nN. We want to find all complex solutions to (5.20)zn=1.

Note that z=1 is always a solution to () if n is even. In such case also z=1 is a solution. If we were only looking for solutions in R, these two would be the only solutions.

However, the Fundamental Theorem of Algebra, see Theorem 111, tells us that there are n complex solutions to ().

Question 66
Is there a way to find all n solutions?

Example 67
We have seen in Example 113 that the solutions to z4=1 are z=1,1,i,i. However, we could only compute these solutions thanks to the clever factorization z41=(z2+1)(z21)=(z+i)(zi)(z+1)(z1). It is not clear if and how this trick can be generalized to solve zn=1 for arbitrary nN.

The trick to find all n solutions to () is to use the exponential form.

Theorem 68
Let nN and consider the equation (5.21)zn=1. All the n solutions to () are given by zk=exp(i2πkn),k=0,,n1, where exp(x) denotes ex.

Proof
Rewrite 1 in exponential form: 1=|1|eiarg(1)=ei2πk,kZ. Therefore () is equivalent to zn=ei2πk. By the properties of the exponential, we see that the above is solved by zk=exp(i2πkn),kZ. By choosing k=0,,n1 we obtain n different solutions.

Definition 69
The n solutions to zn=1 are called the roots of unity.

Example 70
Question. Find all the solutions to z4=1.

Solution. The 4 solutions are given by zk=exp(i2πk4)=exp(iπk2), for k=0,1,2,3. We compute: z0=ei0=1,z1=eiπ2=i,z2=eiπ=1,z3=ei3π2=i. Note that for k=4 we would again get the solution z=ei2π=1.

Example 71
Question. Find all the solutions to z3=1.

Solution. The 3 solutions are given by zk=exp(i2πk3), for k=0,1,2. We compute: z0=ei0=1,z1=ei2π3,z2=ei4π3. We can write z1 and z2 in cartesian form: z1=ei2π3=cos(2π3)+isin(2π3)=12+32i and z2=ei4π3=cos(4π3)+isin(4π3)=1232i.

5.9 Roots in C

Let nN and cC. We want to give a meaning to cn. This means, we want to find all complex solutions to zn=c. The Fundamental Theorem of Algebra ensures that the above equation has n complex solutions. To find these solutions, we pass to the exponential form.

Theorem 72
Let nN, cC and consider the equation (5.22)zn=c. All the n solutions to () are given by zk=|c|nexp(iθ+2πkn),k=0,,n1, where |c|n is the n-th root of the real number |c|, and θ=arg(c).

Proof
Write c in exponential form: c=|c|eiθ=|c|ei(θ+2πk),kZ, where θ=arg(c). Therefore () is equivalent to zn=|c|ei(θ+2πk). By the properties of the exponential, we see that the above is solved by zk=|c|nexp(iθ+2πkn),kZ. By choosing k=0,,n1 we obtain n different solutions.

Warning: The n-th root function is multi-valued
The above result and the Fundamental Theorem of Algebra tell us that z0,,zn1 are n solutions to zn=c. This means that the complex number c has multiple n-th roots, exaclty n. In particular, the n-th root function in C is multi-valued: cn={z0,,zn1}.

Example 73
Question. Find all the zC such that z5=32.

Solution. Let c=32. We have |c|=|32|=32=25,θ=arg(32)=π. The 5 solutions are given by zk=(25)15exp(iπ1+2k5),kZ, for k=0,1,2,3,4. We get z0=2eiπ5z1=2ei3π5z2=2eiπ=2z3=2ei7π5z4=2ei9π5

Example 74
Question. Find all the zC such that z4=9(cos(π3)+isin(π3)).

Solution. Set c:=9(cos(π3)+isin(π3)). The complex number c is already in the trigonometric form, so that we can immediately obtain |c|=9,θ=arg(c)=π3. The 4 solutions are given by zk=94exp(iπ/3+2πk4)=3exp(iπ1+6k12) for k=0,1,2,3. We compute z0=3eiπ112z1=3eiπ712z2=3eiπ1312z3=3eiπ1912