Algorithms Series 15 Days Quick - Fifth Date Five Classic Search

Posted by atl_andy on Wed, 15 May 2019 18:49:17 +0200

Algorithms Series 15 Days Quick - Fifth Date Five Classic Search

From: http://blog.csdn.net/m13666368773/article/details/7516436

Do you know that in fact, there is an O(1) search, the so-called second kill.

Hash Search:

Yeah, he's hash lookup. When it comes to hash, you must mention hash functions. Ha-ha, this thing has already formed in our minds.

Inherent thinking. You must know the corresponding relationship in Hash.

For example, "5" is a number to be saved, and then I throw it to the hash function, which returns me a "2", then the "5" at this time.“

And "2" to establish a corresponding relationship, this relationship is the so-called "hash relationship", in practical applications also formed "2" is key, "5" is value.

Then some friends will ask how to do hashing. First of all, to do hashing must abide by two principles:

(1) Keys are as dispersed as possible, that is, I throw a "6" and "5" to you, and you return a "2", so the hash function is not perfect.

(2) Hash function is as simple as possible, that is to say, throw a "6" to you, and your hash function will take an hour to give me, which is not good.

In fact, there are five ways to do hash:

The first one is "direct address method".

It's easy to understand that key=Value+C; this "C" is a constant. Value+C is actually a simple hash function.

The second one is "dividing to get the remainder".

It's easy to understand, key=value%C; the explanation is the same as above.

The third is "digital analysis".

This is interesting, for example, a set of values 1 = 112233, 2 = 112633, and 3 = 119033.

For such numbers, we analyze the fluctuation of the two numbers in the middle, while the others remain unchanged. So we can take the value of the key.

key1=22,key2=26,key3=90.

The fourth one is "the square is in the middle". Ignore here. See fame.

The fifth is folding method.

This is interesting, for example, value=135790, requiring key to be a hash value of two digits. Then we change value to 13 + 57 + 90 = 160.

Then remove the high "1" and then key=60, haha, that's their hash relationship. The purpose of doing this is that key and every value are alike.

Close, to achieve the "hash address" as scattered as possible.

So-called often walking by the river, there are no wet shoes. Hash is the same. The design of your hash function is so good that it will hit the building one time or another. So the question thrown to us is

That is, if we resolve the hash address conflict.

In fact, there are also two common ways to resolve conflicts:

The first is "Open Address Method".

The so-called "open address" is actually an unused address in an array. That is to say, where a conflict occurs, the element that comes next (in two ways)

Linear detection and function detection) Look for the "open address" after the array and insert themselves into it.

The second is "link method".

It doesn't matter if you don't understand this for a while, so I'll introduce the principle of putting a pointer field on each element, where conflicts occur, and then the one that follows.

When an element throws its own data domain to the element in conflict, a list is formed where the conflict occurs.

There is so much verbosity above, that is to say, I want you to have some reference and means in the two aspects of "designing hash" and "resolving conflict".

So here's the code.

The design function adopts "dividing and residual method".

In the aspect of conflict, the method of "open address linear detection" is adopted.

using System;

using System.Collections.Generic;

using System.Linq;

using System.Text;



namespace HashSearch

{

     class Program

     {

         //"Division and Remainder"

         static int hashLength = 13;



         //Original data

         static List<int> list = new List<int>() { 13, 29, 27, 28, 26, 30, 38 };



         //Hash table length

         static int[] hash = new int[hashLength];



         static void Main(string[] args)

         {

             //Create hash

             for (int i = 0; i < list.Count; i++)

             {

                 InsertHash(hash, hashLength, list[i]);

             }



             Console.WriteLine("Hash Data:" + string.Join(",", hash));



             while (true)

             {

                 Console.WriteLine("\n Please enter the number you want to find:");

                 int result = int.Parse(Console.ReadLine());

                 var index = SearchHash(hash, hashLength, result);



                 if (index != -1)

                     Console.WriteLine("number" + result + "The location of the index is:" + index);

                 else

                     Console.WriteLine("Whining," + result + " stay hash No one found it!");



             }

         }



         ///<summary>

/// Hash table to retrieve data

///</summary>

///<param name="dic"></param>

///<param name="hashLength"></param>

///<param name="key"></param>

///<returns></returns>

         static int SearchHash(int[] hash, int hashLength, int key)

         {

             //hash function

             int hashAddress = key % hashLength;



             //To specify that the hash Adrress corresponding value exists but is not the key value, the open addressing method is used to solve the problem.

             while (hash[hashAddress] != 0 && hash[hashAddress] != key)

             {

                 hashAddress = (++hashAddress) % hashLength;

             }



             //Finding an open cell indicates a search failure

             if (hash[hashAddress] == 0)

                 return -1;

             return hashAddress;



         }



         ///<summary>

/// Data insertion into Hash table

///</summary>

/// <param-name="dic">hash table</param>

///<param name="hashLength"></param>

///<param name="data"></param>

         static void InsertHash(int[] hash, int hashLength, int data)

         {

             //hash function

             int hashAddress = data % 13;



             //If the key exists, it means that it has been occupied by others, and the conflict must be resolved at this time.

             while (hash[hashAddress] != 0)

             {

                 //Find it by open addressing

                 hashAddress = (++hashAddress) % hashLength;

             }



             //Store data in a dictionary

             hash[hashAddress] = data;

         }

     }

}

Result:

Index lookup:

When referring to "index", we estimate that the first reaction is "database index". Right, in fact, the primary key to establish "index" is to facilitate our search in massive data.

As for the knowledge of indexing, it is estimated that everyone knows better than I do. I'll give you a brief introduction.

We write our own algorithms to implement the three terms commonly used in index lookup:

First: main table, this is very simple, to find the object.

Second: Index items. Generally, we use functions to divide a main table into several sub-tables. Each sub-table establishes an index. This index is called index items.

Third: Index table, the collection of index items is index table.

Generally, "index item" contains three contents: index, start, length.

First: index, which is the key word that the index points to the main table.

Second: start, which is the location of index in the main table.

Third: length, which is the interval length of a subtable.

using System;

using System.Collections.Generic;

using System.Linq;

using System.Text;



namespace IndexSearchProgram

{

     class Program

     {

         ///<summary>

/// Index Item Entities

///</summary>

         class IndexItem

         {

             //Values corresponding to the main table

             public int index;

             //Starting position of main table record interval

             public int start;

             //Length of main table record interval

             public int length;

         }



         static void Main(string[] args)

         {

             Console.WriteLine("The original data are as follows:" + string.Join(",", students));





             int value = 205;



             Console.WriteLine("\n insert data" + value);



             //Insert 205 into a collection, overindex

             var index = insert(value);



             //If the insertion is successful, get the location of 205 elements

             if (index == 1)

             {

                 Console.WriteLine("\n Data after insertion:" + string.Join(",", students));

                 Console.WriteLine("\n Data element: 205 is located in the array " + indexSearch(205) + "position");

             }



             Console.ReadLine();

         }



         ///<summary>

/// Student master list

///</summary>

         static int[] students = {

                                    101,102,103,104,105,0,0,0,0,0,

                                    201,202,203,204,0,0,0,0,0,0,

                                    301,302,303,0,0,0,0,0,0,0

                                 };

         ///<summary>

/// Student Index Table

///</summary>

         static IndexItem[] indexItem = {

                                   new IndexItem(){ index=1, start=0, length=5},

                                   new IndexItem(){ index=2, start=10, length=4},

                                   new IndexItem(){ index=3, start=20, length=3},

                                 };



         ///<summary>

/// Find data

///</summary>

///<param name="key"></param>

///<returns></returns>

         public static int indexSearch(int key)

         {

             IndexItem item = null;



             //Establishment of indexing rules

             var index = key / 100;



             //First go to the index.

             for (int i = 0; i < indexItem.Count(); i++)

             {

                 if (indexItem[i].index == index)

                 {

                     item = new IndexItem() { start = indexItem[i].start, length = indexItem[i].length };

                     break;

                 }

             }



             //If item is null, the search fails in the index

             if (item == null)

                 return -1;



             for (int i = item.start; i < item.start + item.length; i++)

             {

                 if (students[i] == key)

                 {

                     return i;

                 }

             }

             return -1;

         }



         ///<summary>

/// Insert data

///</summary>

///<param name="key"></param>

///<returns></returns>

         public static int insert(int key)

         {

             IndexItem item = null;

             //Indexing rules

             var index = key / 100;

             int i = 0;

             for (i = 0; i < indexItem.Count(); i++)

             {

                 //Get the index

                 if (indexItem[i].index == index)

                 {

                     item = new IndexItem()

                     {

                         start = indexItem[i].start,

                         length = indexItem[i].length

                     };

                     break;

                 }

             }

             if (item == null)

                 return -1;

             //Update master table

             students[item.start + item.length] = key;

             //Update index table

             indexItem[i].length++;

             return 1;

         }

     }

}

Result:

ps: Hash lookup time complexity O(1).

Index lookup time complexity: In the case of Demo above, it is equal to O(n/3)+O(length)

Topics: Database

Programmer Think

Algorithms Series 15 Days Quick - Fifth Date Five Classic Search

Algorithms Series 15 Days Quick - Fifth Date Five Classic Search

Hot Topics