Go KNN

TODO

CI for SIMD
MIPS multithread and SIMD

For GPU implementations, see:

FAISS - https://github.com/facebookresearch/faiss
Annoy - https://github.com/spotify/annoy

Installation

go get github.com/cartersusi/go-knn

Usage

Importing

import "github.com/cartersusi/go-knn"

Creating Tensors

Supported Scalars:

float32
float64

Supported Dimensions/Ranks:

1
2

Matrix:

matrix := [][]float32{
	{0.1, 0.2, 0.3, 0.4},
	{0.4, 0.5, 0.6, 0.7},
}
m := &knn.Tensor[float32]{}
m.New(matrix)

Vectors:

vector := []float32{0.2, 0.3, 0.4, 0.5}
v := &knn.Tensor[float32]{}
v.New(vector)

Import/Export

v, err := knn.Import[float32]("vector.tensor")

err = knn.Export(data, "data.tensor")

Searching

Supported SIMD:

ARM NEON, see more at go-simd

New Instance

s := &knn.Search{
	Data: m,		  // 2D Tensor 
	Query: v,		  // 1D Tensor
	Multithread: true,	  // Enable Multithreading (default = false), MIPS not supported
	MaxWorkers:  m.Shape[0],  // Specify MaxWorkers (default = n_cpu_cores)
	SIMD: true //Use SIMD operations, uses float32, it will cast you floats to float32 if using float64
}

New Query

Seach.Query uses the address of a 1D Tensor, so it can be quickly changed for a new iteration.

for query := range all_queries {
	s.Query = query
	nn, _ := s.L1(2)
}

Example using OpenAI Ada (L1)

package main

import (
	"context"
	"fmt"
	"log"

	"github.com/cartersusi/go-knn"
	openai "github.com/sashabaranov/go-openai"
)

func main() {
	openai_token := "my-key"
	client := openai.NewClient(openai_token)

	sentences := []string{
		"The sailor enjoys sailing on a boat in the sea.",
		"The carpenter enjoys building houses with wood.",
		"The athlete enjoys running on the track.",
		"The chef enjoys cooking in the kitchen.",
		"The doctor enjoys helping patients in the hospital.",
		"The scientist enjoys conducting experiments in the laboratory.",
		"The teacher enjoys teaching students in the classroom.",
		"The artist enjoys painting in the studio.",
	}

	query_sentence := "I am a scientist who enjoys fishing when I'm not in the lab."

	var vector []float32
	var matrix [][]float32

	for _, sentence := range sentences {
		queryReq := openai.EmbeddingRequest{
			Input: []string{sentence},
			Model: openai.AdaEmbeddingV2,
		}

		queryResponse, err := client.CreateEmbeddings(context.Background(), queryReq)
		if err != nil {
			log.Fatal("Error creating query embedding:", err)
		}

		matrix = append(matrix, queryResponse.Data[0].Embedding)
	}

	queryReq := openai.EmbeddingRequest{
		Input: []string{query_sentence},
		Model: openai.AdaEmbeddingV2,
	}

	queryResponse, err := client.CreateEmbeddings(context.Background(), queryReq)
	if err != nil {
		log.Fatal("Error creating target embedding:", err)
	}

	vector = queryResponse.Data[0].Embedding

	d := &knn.Tensor[float32]{}
	d.New(matrix)
	
	q := &knn.Tensor[float32]{}
	q.New(vector)

	s := &knn.Search[float32]{
		Data:        d,
		Query:       q,
		SIMD:        true,
		Multithread: true,
	}

	nn, err := s.L1(2)
	if err != nil {
		fmt.Println(err)
	}

	fmt.Println("Query Sentence:", query_sentence)
	fmt.Println("Nearerst neighbor[0]:", sentences[nn.Indices[0]])
	fmt.Println("Nearerst neighbor[1]:", sentences[nn.Indices[1]])
	fmt.Println("Indices:", nn.Indices)
	fmt.Println("Values:", nn.Values)
}

Runtimes (M1 Mac)

Matrix Shape [10000 10000]
Vector Shape [10000 0]

SIMD Multithread: 41.213166ms
SIMD : 223.483208ms
Unrolled Multithread: 111.073541ms
Unrolled: 627.538667ms

SIMD Multithread
Indices: [5833 2932]
Values: [3226.6335 3252.8088]

SIMD
Indices: [5833 2932]
Values: [3226.6335 3252.8088]

Unrolled Multithread
Indices: [5833 2932]
Values: [3226.6335 3252.8088]

Unrolled
Indices: [5833 2932]
Values: [3226.6335 3252.8088]

Sources:

TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s

go-simd

slow-to-simd

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
knn.go		knn.go
knn_test.go		knn_test.go
logger.go		logger.go
mat.go		mat.go
mat_test.go		mat_test.go
results.go		results.go
results_test.go		results_test.go
tensor.go		tensor.go
tensor_test.go		tensor_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Go KNN

TODO

Installation

Usage

Importing

Creating Tensors

Searching

Example using OpenAI Ada (L1)

Runtimes (M1 Mac)

Sources:

About

Releases 14

Packages

Languages

License

cartersusi/go-knn

Folders and files

Latest commit

History

Repository files navigation

Go KNN

TODO

Installation

Usage

Importing

Creating Tensors

Searching

Example using OpenAI Ada (L1)

Runtimes (M1 Mac)

Sources:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 14

Packages 0

Languages

Packages