CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutSign UpSign In

Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place. Commercial Alternative to JupyterHub.

| Download
Views: 17362

Math 152: Intro to Mathematical Software

2017-03-08

Kiran Kedlaya; University of California, San Diego

adapted from lectures by William Stein, University of Washington

** Lecture 24: Algebra and number theory (prep for cryptography): part 1**

(a/k/a preview of Math 187B, to be offered for the first time next quarter)

Announcements:

  • This week's peer grading due Thursday, March 9 at 8pm, as usual. (Ignore the date typo on the HW.)

  • Guest lectures next week:

    • Monday, March 13: Kartik Venkatram

    • Wednesday, March 15: Alina Bucur

  • No sections on Monday, March 13.

  • Office hours next week:

    • Zonglin: Thursday, March 16, 3:30-5:30 in APM 5748.

    • Me: Friday, March 17, 11-12 in APM 7202.

    • Peter: Friday, March 17, 12-1 and 3-4 in APM 6132.

  • Next week's homework due Friday, March 17 at 8pm. Peer grading due Sunday, March 19 at 8pm.

    • The first half of this assignment is "normal" (number theory and cryptography).

    • The second half is freeform: create your own lecture! See the assignment for details.

  • Please do your course evaluation (CAPE)! Evaluations close Monday, March 20 at 8am.

  • HW1-6 grades are available on TritonEd, plus some attendance and peer grading scores. Remember, there is no final exam for this course.

A little number theory

It is very very easy to compute the greatest common divisor of two integers, even very large ones, because of the Euclidean algorithm.

m = 2^300 + 23 n = 2^325 - 73 gcd(m,n)
1

The same technique can be used to find, given two integers mm and nn, two other integers aa and bb such that am+bn=gcd(m,n)am+bn = \gcd(m,n). This is sometimes called the extended Euclidean algorithm, which helps explain the notation in Sage.

d,a,b = xgcd(m,n)
d
1
a,b
(8912264838355107109259232009923446053807867955608007127038327247724194309508961925753300183976782, -265606189917180154003478050527675332242484925854444716186473585597401687786250171832838663)
a*m+b*n
1
xgcd(75, 20)
(5, -1, 4)

These values can be used to make explicit the Chinese remainder theorem: if gcd(m,n)=1\gcd(m,n) = 1, then every system of equations of the form [ x \equiv x_1 \pmod{m}, \qquad x \equiv x_2 \pmod{n} ] has a solution xZx \in \mathbb{Z} (which is unique up to adding a mulitple of mnmn).

x1 = 37 x2 = 265 x = x1*b*n + x2*a*m print(x%m, x%n)
(37, 265)

By contrast, finding the prime factorization of a positive integer can be difficult. Not always, though.

factor(m) ## Takes a bit longer.
3 * 71 * 16428731586668563557701 * 582123405828207087405156630161072170828764652011657810830061815423
factor(n) ## Takes even longer, we won't wait for it.
Error in lines 1-1 Traceback (most recent call last): File "/projects/sage/sage-7.5/local/lib/python2.7/site-packages/smc_sagews/sage_server.py", line 982, in execute exec compile(block+'\n', '', 'single') in namespace, locals File "", line 1, in <module> File "/projects/sage/sage-7.5/local/lib/python2.7/site-packages/sage/arith/misc.py", line 2469, in factor int_ = int_, verbose=verbose) File "sage/rings/integer.pyx", line 3722, in sage.rings.integer.Integer.factor (/projects/sage/sage-7.5/src/build/cythonized/sage/rings/integer.c:24549) F = factor_using_pari(n, int_=int_, debug_level=verbose, proof=proof) File "sage/rings/factorint.pyx", line 352, in sage.rings.factorint.factor_using_pari (/projects/sage/sage-7.5/src/build/cythonized/sage/rings/factorint.c:6854) p, e = n._pari_().factor(proof=proof) File "sage/libs/cypari2/gen.pyx", line 4200, in sage.libs.cypari2.gen.gen.factor (/projects/sage/sage-7.5/src/build/cythonized/sage/libs/cypari2/gen.c:124611) sig_on() File "src/cysignals/signals.pyx", line 97, in cysignals.signals.sig_raise_exception (build/src/cysignals/signals.c:1303) KeyboardInterrupt
factor(n+2) ## But this one is easy!
3 * 1597 * 38149 * 373972150436294039317292792855084259263821049800740324243817706369726376887048555985496579

What makes the last example particularly easy is that you can test for the very small prime factors quickly (trial division), and then it turns out to be much easier to tell whether or not a big number is prime than to actually factor it. There are other features that make some numbers easier to factor than others, but more on this later.

Modular exponentiation

One reason for this is that it is easy to do modular exponentiation, i.e., the remainder of a^b upon division by c.

How not to do this:

(2^m) % n ## No dice
Error in lines 1-1 Traceback (most recent call last): File "/projects/sage/sage-7.5/local/lib/python2.7/site-packages/smc_sagews/sage_server.py", line 982, in execute exec compile(block+'\n', '', 'single') in namespace, locals File "", line 1, in <module> File "sage/rings/integer.pyx", line 2040, in sage.rings.integer.Integer.__pow__ (/projects/sage/sage-7.5/src/build/cythonized/sage/rings/integer.c:14117) raise RuntimeError("exponent must be at most %s" % sys.maxsize) RuntimeError: exponent must be at most 9223372036854775807

What does work:

mod(2,n)^m
9905086126956094233499195448340275387614834137875862846928339996679518341373478678353938460510230
## Or use pure python. pow(2, m, n)
9905086126956094233499195448340275387614834137875862846928339996679518341373478678353938460510230
## Or do this in abstract-algebra style, by creating the ring of integers mod n and calculating there. R = IntegerModRing(n) R(2)^m
9905086126956094233499195448340275387614834137875862846928339996679518341373478678353938460510230

Question: how is this possible? Even working mod n, directly computing 2×...×22 \times ... \times 2 with mm factors is not feasible.

One application of this is the Fermat test for primality. This is based on the little Fermat theorem: if pp is a prime number, then for any positive integer aa, apaa^p - a is divisible by pp. (In the language of congruences, apa(modp)a^p \equiv a \pmod{p}.)

One possible proof (for the case a=2a=2, but this can be extended to a general argument): in the binomial expansion [ (x+y)^p = x^p + \binom{p}{1} x^{p-1}y + \cdots + \binom{p}{p-1} xy^{p-1} + y^p ] all of the intermediate coefficients are divisible by pp (so plugging in x=y=1x=y=1 gives the claim). For example...

P.<x,y> = PolynomialRing(ZZ) ## ZZ = the ring of integers (x+y)^7
x^7 + 7*x^6*y + 21*x^5*y^2 + 35*x^4*y^3 + 35*x^3*y^4 + 21*x^2*y^5 + 7*x*y^6 + y^7
P.<x,y> = PolynomialRing(IntegerModRing(7)) (x+y)^7
x^7 + y^7

The contrapositive of the little Fermat theorem is: if apaa^p-a is not divisible by pp, then pp is not prime. This test works extremely well!

mod(2,m)^m
287232307696181918529284364822969746470277192886883388361201039121560050435826997842095303
mod(2,n)^n ## Couldn't factor it, but definitely not prime
46062872339940065393677818108700647013368935224453932258385083576658008611496018876986624343170833
is_prime(n) ## Uses this and some other methods
False

This method is not foolproof: it can return false positives for primality. Look up Carmichael numbers.

mod(2,561)^561
2
mod(3,561)^561
3
mod(17, 561)^561
17

Consequently, this method cannot be used to certify that a particular number is prime, only that it is not prime. There are efficient algorithms that can certify primality, but they are somewhat harder to describe. One important one is the (Agrawal-Kayal-Saxena (AKS) algorithm)[https://en.wikipedia.org/wiki/AKS_primality_test], which is both extremely simple and polynomial-time (though not as efficient in practice as some other methods). Kayal and Saxena were undergraduates at the time they discovered it (2002)!

One way to say this is that "prime numbers are easy to factor". The reason this is not an empty statement is because you have to know when to stop factoring!

In any case, the Fermat test is so effective that any number that passes it is probably prime, and this is sometimes good enough (though not for cryptography!).

for n in range(10^4): m = 10^500 + n if mod(2,m)^m == 2 and not is_prime(m): print m print("no more examples found")
no more examples found

The fact that prime numbers are (relatively) easy to identify makes it possible to have functions like this:

next_prime(10^125) -10^125
237
%md ## Multiplicative order and primitive roots

Multiplicative order and primitive roots

Let a,na,n be integers with gcd(a,n)=1\gcd(a,n) = 1. Using the extended Euclidean algorithm as above, one can find a multiplicative inverse of aa modulo nn, i.e., a value bb such that ab1(modn)ab \equiv 1 \pmod{n}. In fact, Sage will do this automatically.

a = 577 n = 6825 gcd(a,n)
1
mod(a,n)^(-1)
2413
## Or if you prefer abstract algebra syntax: R = IntegerModRing(n) R(a)^(-1)
2413

As a consequence of the existence of the multiplicative inverse, there always exists a positive integer dd such that ad1(modn)a^d \equiv 1 \pmod{n}. (There must be two powers that coincide mod nn, and we can cancel the powers of aa from one side.)

If nn is prime, the little Fermat theorem implies that d=n1d = n-1 always works (although it need not be the smallest such value; more on this in a moment). For general nn, there is a generalization of the Fermat theorem due to Euler: we have aϕ(n)1(modn)a^{\phi(n)} \equiv 1 \pmod{n} where ϕ(n)\phi(n) denotes the Euler phi function (or totient function): if nn has prime factorization p1e1×p2e2×p_1^{e_1} \times p_2^{e_2} \times \cdots, then [ \phi(n) = (p_1 - 1)p_1^{e_1}-1(p_2-1)p_2^{e_2-1} \cdots. ] For those who know group theory: recall that this works because the residue classes mod nn which have no common factor with nn form a group under multiplication, and ϕ(n)\phi(n) is its order.

factor(561) euler_phi(561)
3 * 11 * 17 320
list(factor(561))
[(3, 1), (11, 1), (17, 1)]
for (d,e) in factor(561): print euler_phi(d), 560 % euler_phi(d) #Aha!
2 0 10 0 16 0

So now it makes sense to consider the smallest positive integer dd such that ad1(modn)a^d \equiv 1 \pmod{n}. This integer must divide ϕ(n)\phi(n), otherwise the remainder of ϕ(n)\phi(n) mod dd would be an even smaller value. (Or use group theory!) This dd is called the multiplicative order of dd mod nn.

for d in range(1, 17): print (d, multiplicative_order(mod(d,17)))
(1, 1) (2, 8) (3, 16) (4, 4) (5, 16) (6, 16) (7, 16) (8, 8) (9, 8) (10, 16) (11, 16) (12, 16) (13, 4) (14, 16) (15, 8) (16, 2)

Important result: if nn is prime, then there is always at least one value of aa for which the multiplicative order of aa mod nn is equal to the maximum possible value ϕ(n)=n1\phi(n) = n-1. Any such aa is called a primitive root mod nn. These play an important role in the use of discrete logarithms in cryptography.

(Abstract algebra interpretation; if nn is prime, then (Z/nZ)(\ZZ/n\ZZ)^* is a cyclic group!)