simhash
Here are 59 public repositories matching this topic...
Dynatrace hash library for Java
-
Updated
Jul 16, 2024 - Java
Python web crawler designed to scrape websites
-
Updated
Jul 10, 2024 - Python
Implementacija algoritama predstavljenih na predmetu Analiza velikih skupova podataka (AVSP)
-
Updated
May 19, 2024 - Java
In this repository you can find an implementation of LSH (Local | Sensitive Hashing) and Finesse algorithms, designed to find similar data based on their hashes
-
Updated
Mar 22, 2024 - C++
Proof-of-concept for measuring similarity of phoneme sequences using locality sensitive hashing (LSH).
-
Updated
Jan 11, 2024 - Jupyter Notebook
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
-
Updated
Aug 28, 2023 - Python
🐾 Create a behavioral fingerprint based on your zsh command line history
-
Updated
Aug 14, 2023 - Python
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
-
Updated
Jul 20, 2023 - Go
Locality Sensitive Hashing
-
Updated
Jul 12, 2023 - Rust
基于springboot和Google开源simhash算法实现的作业查重/抄袭检测/文本相似度分析可视化系统,,集成jplag、MOSS、singleCloud工具套件进行多方位查重 Ref: https://github.com/ALuShu/checksystem
-
Updated
Mar 9, 2023 - JavaScript
Find duplicate text files.
-
Updated
May 3, 2024 - Python
A library for cosine similarity & simhash calculation
-
Updated
Dec 30, 2022 - Elixir
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
-
Updated
Nov 18, 2022 - Python
Improve this page
Add a description, image, and links to the simhash topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the simhash topic, visit your repo's landing page and select "manage topics."