Arithmetic

Arithmetic #

\(\lg\) is shorthand for \(\log_2\)

\(\log N\) is the power to which \(2\) must be raised to obtain \(N\).

\(\lceil \log N \rceil\) is the number of times \(N\) must be halved to obtain unity. This will be useful when a number is halved at each iteration of an algorithm.

\(\lceil \log (N + 1) \rceil\) is the number of bits in the binary representation of \(N\).

\(\lfloor \log N \rfloor\) is the depth of a complete binary tree with \(N\) nodes.

\(\log N = 1 + 1/2 + 1/3 + \dotsb + 1/N\) to within a constant factor.

Base-\(b\) representation of a non-negative integer #

BASE-\(b\) REPRESENTATION OF A NON-NEGATIVE INTEGER

The base-\(b\) representation of a nonnegative integer \(x\) is the shortest sequence of integer digits \((x_i)\) such that each digit satisfies

\(0 \le x_i \lt b\)

and

\( \begin{aligned} x = \sum_{i=0}^{d-1} x_i b^i \end {aligned} \)

where \(d\) is the number of digits of \(x\) in base \(b\).

Balanced base-\(b\) representation of a non-negative integer #

BALANCED BASE-\(b\) REPRESENTATION OF A NON-NEGATIVE INTEGER

The balanced base-\(b\) representation of a nonnegative integer \(x\) is the shortest sequence of integer digits \((x_i)\) such that each digit satisfies

\( \begin{aligned} -\left\lfloor \frac{b}{2} \right\rfloor \le x_i \le \left\lfloor \frac{b-1}{2} \right\rfloor \end {aligned} \)

and

\( \begin{aligned} x = \sum_{i=0}^{d-1} x_i b^i \end {aligned} \)

where \(d\) is the number of digits of \(x\) in base \(b\).

Example: Balanced Ternary

If \(b = 3\) then

\( \begin{aligned} \underbrace{-\left\lfloor \frac{3}{2} \right\rfloor}_{-1} \le x_i \le \underbrace{\left\lfloor \frac{3-1}{2} \right\rfloor}_{1} \end {aligned} \)

Define \(-1 = \text{T}\).

Thus the three digits of balanced ternary are

\(\text{T}, 0, 1\)

\( \begin{aligned} -9 &=& \text{T}00 \\ -8 &=& \text{T}01 \\ -7 &=& \text{T}1\text{T} \\ -6 &=& \text{T}10 \\ -5 &=& \text{T}11 \\ -4 &=& \text{TT} \\ -3 &=& \text{T}0 \\ -2 &=& \text{T}1 \\ -1 &=& \text{T} \\ 0 &=& 0 \\ 1 &=& 1 \\ 2 &=& 1\text{T} \\ 3 &=& 10 \\ 4 &=& 11 \\ 5 &=& 1\text{TT} \\ 6 &=& 1\text{T}0 \\ 7 &=& 1\text{T}1 \\ 8 &=& 10\text{T} \\ 9 &=& 100 \\ \end {aligned} \)

[ y ] 12-29-2023 Cadaeic Studios. “The Magic of Balanced Bases”.

Number of digits required to represent a non-negative integer in base \(b\)#

Claim

\(\lceil \log_b (n + 1) \rceil\) digits are needed to represent a non-negative integer \(n\) in base \(b \ge 2\).

Proof

With \(k \ge 1\) digits in base \(b\) we can express non-negative integers \(n\) up to \(b^k - 1\)

\( \begin{aligned} b^k - 1 && &\ge n && \gt b^{k-1} - 1 \\ b^k && &\ge n + 1 && \gt b^{k-1} \\ \log_b b^k && = k &\ge \log_b (n + 1) && \gt k-1 \\ && &= \lceil \log_b (n + 1) \rceil && \\ \end {aligned} \)

\(\blacksquare\)

It is a basic property of the ceiling function that

\( \begin{aligned} 0 & \le \lceil \log_b (n + 1) \rceil - \log_b (n + 1) && \lt 1 \\ -\lceil \log_b (n + 1) \rceil & \le \phantom{\lceil \log_b (n + 1) \rceil} -\log_b (n + 1) && \lt 1 - \lceil \log_b (n + 1) \rceil \\ \lceil \log_b (n + 1) \rceil & \ge \phantom{\lceil \log_b (n + 1) \rceil-} \log_b (n + 1) && \gt \lceil \log_b (n + 1) \rceil - 1 \\ b^{\lceil \log_b (n + 1) \rceil} & \ge \phantom{\lceil \log_b (n + 1) \rceil- \log_b} (n + 1) && \gt b^{\lceil \log_b (n + 1) \rceil - 1} \\ b^{\lceil \log_b (n + 1) \rceil} - 1 & \ge \phantom{\lceil \log_b (n + 1) \rceil- \log_b(} n & & \gt b^{\lceil \log_b (n + 1) \rceil - 1} - 1 \\ \end {aligned} \)

Example

base \(2\)

with \(1\) digits in base \(2\) we can express numbers up to \(2^1 - 1 = 1\)
with \(2\) digits in base \(2\) we can express numbers up to \(2^2 - 1 = 3\)
with \(3\) digits in base \(2\) we can express numbers up to \(2^3 - 1 = 7\)
with \(4\) digits in base \(2\) we can express numbers up to \(2^4 - 1 = 15\)

base \(10\)

with \(1\) digits in base \(10\) we can express numbers up to \(10^1 - 1 = 9\)
with \(2\) digits in base \(10\) we can express numbers up to \(10^2 - 1 = 99\)
with \(3\) digits in base \(10\) we can express numbers up to \(10^3 - 1 = 999\)
with \(4\) digits in base \(10\) we can express numbers up to \(10^4 - 1 = 9999\)

Base \(2\)

\( \begin{aligned} \lceil \log_2 (\textcolor{green}{ 0} + 1) \rceil &= 1 \text{ b} \\ \lceil \log_2 (\textcolor{green}{ 1} + 1) \rceil &= 1 \text{ b} \\ \lceil \log_2 (\textcolor{green}{ 2} + 1) \rceil &= 2 \text{ b} \\ \lceil \log_2 (\textcolor{green}{ 3} + 1) \rceil &= 2 \text{ b} \\ \lceil \log_2 (\textcolor{green}{ 4} + 1) \rceil &= 3 \text{ b} \\ &\vdots \\ \lceil \log_2 (\textcolor{green}{ 7} + 1) \rceil &= 3 \text{ b} \\ \lceil \log_2 (\textcolor{green}{ 8} + 1) \rceil &= 4 \text{ b} \\ &\vdots \\ \lceil \log_2 (\textcolor{green}{15} + 1) \rceil &= 4 \text{ b} \\ \lceil \log_2 (\textcolor{green}{16} + 1) \rceil &= 5 \text{ b} \\ &\vdots \\ \lceil \log_2 (\textcolor{green}{31} + 1) \rceil &= 5 \text{ b} \\ \lceil \log_2 (\textcolor{green}{32} + 1) \rceil &= 6 \text{ b} \\ &\vdots \\ \lceil \log_2 (\textcolor{green}{63} + 1) \rceil &= 6 \text{ b} \\ \lceil \log_2 (\textcolor{green}{64} + 1) \rceil &= 7 \text{ b} \\ \end {aligned} \)

print(f'                 largest value')
print(f' 16-bit numbers {2** 16 - 1:85} = 2^ 16 - 1')
print(f' 20-bit numbers {2** 20 - 1:85} = 2^ 20 - 1')
print(f' 32-bit numbers {2** 32 - 1:85} = 2^ 32 - 1')
print(f' 40-bit numbers {2** 40 - 1:85} = 2^ 40 - 1')
print(f' 52-bit numbers {2** 52 - 1:85} = 2^ 52 - 1')
print(f' 53-bit numbers {2** 53 - 1:85} = 2^ 53 - 1')
print(f' 64-bit numbers {2** 64 - 1:85} = 2^ 64 - 1')
print(f'128-bit numbers {2**128 - 1:85} = 2^128 - 1')
print(f'256-bit numbers {2**256 - 1:85} = 2^128 - 1')

                 largest value
 16-bit numbers                                                                                 65535 = 2^ 16 - 1
 20-bit numbers                                                                               1048575 = 2^ 20 - 1
 32-bit numbers                                                                            4294967295 = 2^ 32 - 1
 40-bit numbers                                                                         1099511627775 = 2^ 40 - 1
 52-bit numbers                                                                      4503599627370495 = 2^ 52 - 1
 53-bit numbers                                                                      9007199254740991 = 2^ 53 - 1
 64-bit numbers                                                                  18446744073709551615 = 2^ 64 - 1
128-bit numbers                                               340282366920938463463374607431768211455 = 2^128 - 1
256-bit numbers        115792089237316195423570985008687907853269984665640564039457584007913129639935 = 2^128 - 1

How much does the size of a number change when its base changes?

The rule for converting logarithms from base \(a\) to base \(b\):

\(\log_b N = \frac{1}{\log_a b} \log_a N \iff \log_a N = \log_b N \times \log_a b\)

The size of integer \(N\) in base \(a\) is the same as its size in base \(b\) times a constant factor \(\log_a b\). Therefore, in big-\(O\) notation the base is irrelevant and the size is written simply as \(O(\log N)\)

Addition #

Claim: the sum of any three single-digit numbers is at most two digits long #

Claim

It is a basic property of numbers that for any base \(b \ge 2\) the sum of any three single-digit numbers is at most two digits long.

Example

\( \begin{aligned} b &= 2 && & 1 + 1 + 1 &= 11 &&= 1 \times 2^1 + 1 \times 2^0 \\ b &= 3 && & 2 + 2 + 2 &= 20 &&= 2 \times 3^1 + 0 \times 3^0 \\ b &= 10 && & 9 + 9 + 9 &= 27 &&= 2 \times 10^1 + 7 \times 10^0 \\ b &= 16 && & F + F + F &= 2D &&= 2 \times 16^1 + 13 \times 16^0 \\ b &= 60 && & \lambda_{59} + \lambda_{59} + \lambda_{59} &= 2\lambda_{57} &&= 2 \times 60^1 + 57 \times 60^0 \\ \end {aligned} \)

Addition #

The rule that the sum of any three single-digit numbers is at most two digits allows us to add two numbers in any base in the following way: align their right-hand ends and then perform a single right-to-left pass in which the sum is computed digit by digit, maintaining the overflow as a carry. Since we know the individual sum is a two-digit number, the carry is always a single digit, and so at any given step, three single-digit numbers are added.

Carry  1     1 1 1
         1 1 0 1 0 1 (53)
         1 0 0 0 1 1 (35)
       -------------
       1 0 1 1 0 0 0 (88)

Complexity

Suppose both \(x\) and \(y\) are \(n\) bits long. Then \(x + y\) is at most \(n + 1\) bits long and each bit of this sum is computed in a fixed amount of time.

\(T(n) = c_1n + c_0 = O(n)\)

Is there a faster algorithm? In order to add two \(n\)-bit numbers they must at least be read and their sum written, which require n operations. Therefore, the addition algorithm is optimal up to multiplicative constants.

It is true that in a single processor instruction integers can be added whose size in bits is within the word length of today’s computers–\(32\) perhaps. But it is often useful and necessary to handle numbers much larger than this, perhaps several thousand bits long. Adding and multiplying such large numbers on real computers is very much like performing the operations bit by bit.

Implementation

#include <stdio.h>

int main (void) {
  long b1;
  long b2;

  int i = 0;
  int r = 0; // remainder
  int sum[20];

  printf("Enter the first  binary number: "); scanf("%ld", &b1);
  printf("Enter the second binary number: "); scanf("%ld", &b2);

  while (b1 != 0 || b2 != 0) {
    sum[i++] = (b1 % 10 + b2 % 10 + r) % 2;
    r        = (b1 % 10 + b2 % 10 + r) / 2;
    b1 /= 10;
    b2 /= 10;
  }

  if (r != 0)
    sum[i++] = r;
  --i;

  printf("Sum: ");
  while (i >= 0)
    printf("%d", sum[i--]);
  printf("\n");

  return 0;
}

Multiplication #

Claim: the product of any two single-digit numbers is at most two digits long #

Claim

It is a basic property of numbers that for any base \(b \ge 2\) the product of any two single-digit numbers is at most two digits long.

Example

\( \begin{aligned} b &= 2 && & 1 \times 1 &= 1 &&= 0 \times 2^1 + 1 \times 2^0 \\ b &= 3 && & 2 \times 2 &= 11 &&= 1 \times 3^1 + 1 \times 3^0 \\ b &= 10 && & 9 \times 9 &= 81 &&= 8 \times 10^1 + 1 \times 10^0 \\ b &= 16 && & F \times F &= E1 &&= 14 \times 16^1 + 1 \times 16^0 \\ b &= 60 && & \lambda_{59} \times \lambda_{59} &= 2\lambda_{57} &&= 58 \times 60^1 + 1 \times 60^0 \\ \end {aligned} \)

Grade-School Binary Multiplication #

Algorithm

Compute the product \(z = xy\) of two non-negative integers \(x\) and \(y\) in base-\(b\) representation. Right-align \(x\) and \(y\) and then construct a rhombus-shaped array of digitwise products via repeated multiplications of two single-digit numbers. To complete the multiplication sum the columns with carry. We end up summing columns to construct integers

\( \begin{aligned} w_n = \sum_{i+j=n} x_i y_i \end {aligned} \)

where \(i\) and \(j\) run through all indices in the respective digit lists for \(x\) and \(y\). The sequence \((w_n)\) is not yet in general the base-\(b\) representation of the product \(z\). We must still perform the \(w_n\) additions with carry. A column sum \(w_n\) affects not only the digit \(z_n\) but sometimes higher-order digits beyond this. For example, if

\(w_0 = b + 5\)

then \(z_0 = 5\) but a \(1\) must be added to \(z_1\); that is, a carry occurs.

\( \begin{array}{rrrrrrrrrr} &&&&&& & & & x_{d-1} & x_{d-2} & x_{d-3} & x_{d-4} & \dots & x_3 & x_2 & x_1 & x_0 \\ &&&&&& & & & y_{d-1} & y_{d-2} & y_{d-3} & y_{d-4} & \dots & y_3 & y_2 & y_1 & y_0 \\ \hline &&&&&& & & & x_{d-1} y_0 & x_{d-2} y_0 & x_{d-3} y_0 & x_{d-4} y_0 & \dots & x_3 y_0 & x_2 y_0 & x_1 y_0 & x_0 y_0 \\ &&&&&& & & x_{d-1} y_1 & x_{d-2} y_1 & x_{d-3} y_1 & x_{d-4} y_1 & x_{d-5} y_1 & \dots & x_2 y_1 & x_1 y_1 & x_0 y_1 \\ &&&&&& & x_{d-1} y_2 & x_{d-2} y_2 & x_{d-3} y_2 & x_{d-4} y_2 & x_{d-5} y_2 & x_{d-6} y_2 & \dots & x_1 y_2 & x_0 y_2 \\ &&&&&& x_{d-1} y_3 & x_{d-2} y_3 & x_{d-3} y_3 & x_{d-4} y_3 & x_{d-5} y_3 & x_{d-6} y_3 & x_{d-7} y_3 & \dots & x_0 y_3 \\ &&&&&& \vdots & \vdots & \vdots & \vdots \\ & x_{d-1} y_{d-1} & x_{d-2} y_{d-1} & x_{d-3} y_{d-1} & x_{d-4} y_{d-1} & \dots & x_3 y_{d-1} & x_2 y_{d-1} & x_1 y_{d-1} & x_0 y_{d-1} \\ \hline z_{2d-1} & z_{2d-2} & z_{2d-3} & z_{2d-4} & z_{2d-5} & \dots & z_{d+2} & z_{d+1} & z_d & z_{d-1} & z_{d-2} & z_{d-3} & z_{d-4} & \dots & z_3 & z_2 & z_1 & z_0 \\ \end {array} \)

Complexity

If each of \(x\) and \(y\) to be multiplied has \(d\) digits in base \(b\) then there are \(d\) intermediate rows with lengths of up to \(2d\) digits, accounting for the shifts. The total time taken to add up these rows, two numbers at a time, is

\(\underbrace{O(d) + O(d) + \dotsb + O(d)}_{d - 1 \text{ times}} = O(d^2)\)

In other words, the total number of operations required to calculate \(xy\) is \(O(d^2)\) since that is how many entries appear in the rhombus-shaped array. Here, an operation is either a multiply or an add of two numbers each of size \(b\). Such a digitwise multiply is called a size-\(b\) multiply.

Example

\(13 \times 11\)

        1 1 0 1 (13, multiplicand)
      x 1 0 1 1 (11, multiplier)
      ---------
        1 1 0 1 (1101 times 1)                  13 x 2^0 =  13
      1 1 0 1   (1101 times 1, shifted once)    13 x 2^1 =  26
    0 0 0 0     (1101 times 0, shifted twice)    0 x 2^2 =   0
+ 1 1 0 1       (1101 times 1, shifted thrice)  13 x 2^3 = 104
---------------
1 0 0 0 1 1 1 1 (binary 143)

In binary each intermediate row is either zero or \(x\) itself, left-shifted an appropriate amount of times. Left-shifting multiplies by the base and right-shifting divides by the base, rounding down if necessary.

Squaring #

\( \begin{array}{rrrrrrrrrr} &&&&&& & & & x_{d-1} & x_{d-2} & x_{d-3} & x_{d-4} & \dots & x_3 & x_2 & x_1 & x_0 \\ &&&&&& & & & y_{d-1} & y_{d-2} & y_{d-3} & y_{d-4} & \dots & y_3 & y_2 & y_1 & y_0 \\ \hline &&&&&& & & & x_{d-1} y_0 & x_{d-2} y_0 & x_{d-3} y_0 & x_{d-4} y_0 & \dots & x_3 y_0 & x_2 y_0 & x_1 y_0 & x_0 y_0 \\ &&&&&& & & x_{d-1} y_1 & x_{d-2} y_1 & x_{d-3} y_1 & x_{d-4} y_1 & x_{d-5} y_1 & \dots & x_2 y_1 & x_1 y_1 & x_0 y_1 \\ &&&&&& & x_{d-1} y_2 & x_{d-2} y_2 & x_{d-3} y_2 & x_{d-4} y_2 & x_{d-5} y_2 & x_{d-6} y_2 & \dots & x_1 y_2 & x_0 y_2 \\ &&&&&& x_{d-1} y_3 & x_{d-2} y_3 & x_{d-3} y_3 & x_{d-4} y_3 & x_{d-5} y_3 & x_{d-6} y_3 & x_{d-7} y_3 & \dots & x_0 y_3 \\ &&&&&& \vdots & \vdots & \vdots & \vdots \\ & x_{d-1} y_{d-1} & x_{d-2} y_{d-1} & x_{d-3} y_{d-1} & x_{d-4} y_{d-1} & \dots & x_3 y_{d-1} & x_2 y_{d-1} & x_1 y_{d-1} & x_0 y_{d-1} \\ \hline z_{2d-1} & z_{2d-2} & z_{2d-3} & z_{2d-4} & z_{2d-5} & \dots & z_{d+2} & z_{d+1} & z_d & z_{d-1} & z_{d-2} & z_{d-3} & z_{d-4} & \dots & z_3 & z_2 & z_1 & z_0 \\ \end {array} \)

Al Khwarizmi’s multiplication by repeated halvings of the multiplier #

Multiply two decimal numbers \(x\) and \(y\). Write them next to each other and then repeat the following: divide the first number by \(2\), rounding down the result, and double the second number; continue until the first number reaches unity and then strike out all the rows in which the first number is even and add up whatever remains in the second column.

                      parity of multiplier 11
11    13              11 = 5 x 2 +  1
 5    26               5 = 2 x 2 +  1
 2    52 (strike out)  2 = 1 x 2 +  0
 1   104               1 = 0 x 2 +  1
--------
     143

Multiplication à la Français #

This algorithm implements the following recursive rule.

\( x \cdot y = \begin{cases} 2(x \cdot \lfloor y/2 \rfloor) & \text{if \textit{y} is even} \\ x + 2(x \cdot \lfloor y/2 \rfloor) & \text{if \textit{y} is odd} \\ \end {cases} \)

 INPUT    two n-bit integers x and y, where y >= 0
OUTPUT    their product

MULT (x, y)
1    if y = 0
2        return 0
3
4    z = MULT(x, floor(y/2))
5
6    if even(y)
7        return 2z
8    else
9        return 2z + x

EXAMPLE

algorithm decomposition

\(x \cdot 25 = x \cdot 16 + x \cdot 8 + x \cdot 1\)

For multiplication the terms \(x \cdot 2^i\) come from repeated doubling

Correctness

The recursive rule is transparently correct. Checking that the algorithm is correct is a matter of verifying that it mimics the rule and that it handles the base case \(y = 0\) properly.

Complexity

The algorithm terminates after \(n\) recursive calls because at each call \(y\) is halved (i.e., its number of bits is decreased by one). Each recursive call requires the following operations:

a division by \(2\) (a right shift)
a test for odd/even (looking up the last bit)
a multiplication by \(2\) (a left shift)
possibly, one addition

a total of \(O(n)\) bit operations. The total running time is

\(T(n) = O(n^2)\)

Can we do better? Intuitively, it seems that multiplication requires adding about \(n\) multiples of one of the inputs, and we know that each addition is linear, so it would appear that \(n^2\) bit operations are inevitable. But we will see that we can de better.

Division #

To divide an integer \(x\) by an integer \(y \ne 0\) means to find a quotient \(q\) and a remainder \(r\), where \(x = yq + r\) and \(r \lt y\).

 INPUT    two n-bit integers x and y, where y >= 1
OUTPUT    the quotient and the remainder of x divided by y

DIVIDE (x, y)
  if x = 0
      return (q, r) = (0, 0)
 3
  (q, r) = DIVIDE(floor(x/2), y)
  q = 2q
  r = 2r
  if odd(x)
      r = r + 1
  if r >= y
      r = r - y
      q = q + 1
  return (q, r)

Complexity Like multiplication, division takes quadratic time.

\(T(n) = O(n^2)\)

Terms #

[ w ] Balanced Ternary
[ w ] Binary Multiplier
[ w ] Multiplication Algorithm
[ w ] Multiply-Accumulate
[ w ] Signed-Digit Representation

Acknowledgements #

2005 Crandall, Richard & Carl Pomerance. Prime Numbers: A Computational Perspective. Springer.

2006 Dasgupta, Sanjoy; Christos Papadimitriou; & Umesh Vazirani. Algorithms. Chapter 1. McGraw-Hill.

2018 Rosen, Kenneth. Discrete Mathematics and its Applications 8e. McGraw-Hill.

2023 Vaughan, Robert. A Course of Elementary Number Theory.

Arithmetic

Contents