Developers Field: Insertion sort

Insertion sort is an elementary sorting algorithm. It has a time complexity of Θ(n²), thus being slower than heapsort, merge sort and also shellsort. Insertion sort is well suited for sorting small data sets or for the insertion of new elements into a sorted sequence.

Let a₀, ..., a_n-1 be the sequence to be sorted. At the beginning and after each iteration of the algorithm the sequence consists of two parts: the first part a₀, ..., a_i-1 is already sorted, the second part a_i, ..., a_n-1 is still unsorted (i element

0, ..., n).

In order to insert element a_i into the sorted part, it is compared with a_i-1, a_i-2 etc. When an element a_j with a_j

a_i is found, a_i is inserted behind it. If no such element is found, then a_i is inserted at the beginning of the sequence.

After inserting element a_i the length of the sorted part has increased by one. In the next iteration, a_i+1 is inserted into the sorted part etc. While at the beginning the sorted part consists of element a₀ only, at the end it consists of all elements a₀, ..., a_n-1.

Example: The following table shows the steps for sorting the sequence 5 7 0 3 4 2 6 1. On the left side the sorted part of the sequence is shown in red. For each iteration, the number of positions the inserted element has moved is shown in brackets. Altogether this amounts to 17 steps.

5	7	0	3	4	2	6	1	(0)
5	7	0	3	4	2	6	1	(0)
0	5	7	3	4	2	6	1	(2)
0	3	5	7	4	2	6	1	(2)
0	3	4	5	7	2	6	1	(2)
0	2	3	4	5	7	6	1	(4)
0	2	3	4	5	6	7	1	(1)
0	1	2	3	4	5	6	7	(6)

The following simulation illustrates the realization of the algorithm. The corresponding program is given subsequently.

Simulation of insertion sort

The following function insertionsort sorts an integer array a[0], ..., a[n-1].

The sorting function is encapsulated in a class InsertionSorter. The method sort passes the array to be sorted to an array a, sets n to its length and calls insertionsort.

The statement to sort an array b with insertion sort is

InsertionSorter.sort(b);

The program follows.

public class InsertionSorter
{
    private static int[] a;
    private static int n;

    public static void sort(int[] a0)
    {
        a=a0;
        n=a.length;
        insertionsort();
    }

    private static void insertionsort()
    {
        int i, j, t;
        for (i=1; i<n; i++)
        {
            j=i;
            t=a[j];
            while (j>0 && a[j-1]>t)
            {
                a[j]=a[j-1];
                j--;
            }
            a[j]=t;
        }
    }

}    // end class InsertionSorter

The worst case occurs when in every step the proper position for the element that is inserted is found at the beginning of the sorted part of the sequence. I.e. in the while-loop sequences of length 1, 2, 3, ..., n-1 are scanned. Altogether, these are (n-1)·n / 2 element

Θ(n²) operations. This case occurs when the original sequence is sorted in decreasing order.

It is possible to find the inserting position of element a_i faster, namely by binary search. However, moving the elements to the right in order to make room for the element to be inserted takes linear time anyway.

The exact number of steps required by insertion sort is given by the number of inversions of the sequence.

Definition: Let a = a₀, ..., a_n-1 be a finite sequence. An inversion is a pair of index positions, where the elements of the sequence are out of order. Formally, an inversion is a pair (i, j), where i < j and a_i > a_j.¹)

Example: Let a = 5, 7, 4, 9, 7. Then, (0, 2) is an inversion, since a₀ > a₂, namely 5 > 4. Also, (1, 2) is an inversion, since 7 > 4, and (3, 4) is an inversion, since 9 > 7. There are no other inversions in this sequence.

We now count the inversions (i, j) of a sequence a separately for each position j. The resulting value of v_j gives the number of elements a_i to the left of a_j which are greater than a_j.

For instance, for the sequence a = 5, 7, 4, 9, 7 we have v₂ = 2, since the two inversions (0, 2) and (1, 2) have 2 as their second component. Here, 5 and 7 are greater than 4.

Definition: The inversion sequence v = v₀, ..., v_n-1 of a sequence a = a₀, ..., a_n-1 is defined by

v_j = |{ (i, j) | i < j and

a_i > a_j }|

for j = 0, ..., n-1.

Example: The above sequence a = 5, 7, 4, 9, 7 has the inversion sequence v = 0, 0, 2, 0, 1.

Obviously, we have v_i

i for all i = 0, ..., n-1. If all v_i are 0, then and only then the sequence a is sorted. If the sequence a is a permutation, then it is uniquely determined by its inversion sequence v. The permutation n-1, ..., 0 has inversion sequence 0, ..., n-1.

Theorem: Let a = a₀, ..., a_n-1 be a sequence and v = v₀, ..., v_n-1 be its inversion sequence. Then the number of steps T(a) required by insertion sort to sort sequence a is

T(a) = sum over

_{i = 0, ..., n-1} v_i

Proof: Obviously, insertion sort takes v_i steps in each iteration i. Thus, the overall number of steps is equal to the sum of all v_i.

Example: The following table shows the sequence a of the first example and its corresponding inversion sequence. For instance,v₅ = 4 since 4 elements to the left of a₅ = 2 are greater than 2 (namely 5, 7, 3 and 4). The total number of inversions is 17.

`i`	0	1	2	3	4	5	6	7
`a`_i	5	7	0	3	4	2	6	1
`v`_i	0	0	2	2	2	4	1	6

Thus, insertion sort requires 17 steps to sort this sequence (see first example).

http://www.iti.fh-flensburg.de/lang/algorithmen/sortieren/insert/insertionen.htm

http://www.programiz.com/article/insertion-sort-algorithm-programming

Monday, 17 November 2014

Insertion sort

Idea

Implementation

Simulation of insertion sort

Analysis

No comments:

Post a Comment

Recent Post

Should I use ConfigureAwait(true) or ConfigureAwait(false)?

Report Abuse

5	7	0	3	4	2	6	1	(0)
5	7	0	3	4	2	6	1	(0)
0	5	7	3	4	2	6	1	(2)
0	3	5	7	4	2	6	1	(2)
0	3	4	5	7	2	6	1	(2)
0	2	3	4	5	7	6	1	(4)
0	2	3	4	5	6	7	1	(1)
0	1	2	3	4	5	6	7	(6)

5	7	0	3	4	2	6	1	(0)
5	7	0	3	4	2	6	1	(0)
0	5	7	3	4	2	6	1	(2)
0	3	5	7	4	2	6	1	(2)
0	3	4	5	7	2	6	1	(2)
0	2	3	4	5	7	6	1	(4)
0	2	3	4	5	6	7	1	(1)
0	1	2	3	4	5	6	7	(6)

5	7	0	3	4	2	6	1	(0)
5	7	0	3	4	2	6	1	(0)
0	5	7	3	4	2	6	1	(2)
0	3	5	7	4	2	6	1	(2)
0	3	4	5	7	2	6	1	(2)
0	2	3	4	5	7	6	1	(4)
0	2	3	4	5	6	7	1	(1)
0	1	2	3	4	5	6	7	(6)