Skip to content

pyvec is a from-scratch vector database that uses TF-IDF and cosine similarity

License

Notifications You must be signed in to change notification settings

adammikulis/pyvec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This program takes data and applies the Term Frequency/Inverse Document Frequency algorithm to vectorize text. It then allows the user to compare/query the text utilizing cosine similarity.

All test data is currently in the project, click Run All in the main notebook and query the database.

About

pyvec is a from-scratch vector database that uses TF-IDF and cosine similarity

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published