A Small World is inspired by the concept of “six degrees of separation,” which suggests that any two people on Earth are, on average, six or fewer social connections apart. In graph theory, a small-world network is characterized by a high clustering coefficient and short average path lengths between nodes. To get more insights into small-world networks, we can refer to the Watts Strogatz Model, which is a mathematical model that generates graphs with small-world properties.
On top of the small-world concept, we have a problem to solve: similarity search in high-dimensional spaces. Similarity Search is a common task in various applications, such as image retrieval, recommendation systems, and natural language processing. The goal is to find the most similar items to a given query item based on a defined similarity metric. The state-of-the-art solution to this problem is the Hierarchical Navigable Small World (HNSW) algorithm. HNSW is designed to efficiently find approximate nearest neighbors in high-dimensional spaces by leveraging the properties of small-world networks.
Nearest Neighbor Search is a fundamental problem in machine learning and data mining, where the goal is to find the closest data points (neighbors) to a given query point based on a defined distance metric (e.g., Euclidean distance).
Skip Lists are a probabilistic data structure that allows for fast search, insertion, and deletion operations by maintaining multiple levels of linked lists. The data in skip lists is organized in layers, where each higher layer acts as an “express lane” for the lower layers, enabling quicker traversal through the list.