Skip to content
@dingodb

DingoDB

A distributed multi-modal vector database

DingoDB

DingoDB is a distributed multi-modal vector database. It combines the features of a data lake and a vector database, allowing for the storage of any type of data (key-value, PDF, audio, video, etc.) regardless of its size. Utilizing DingoDB, you can construct your own Vector Ocean (the next-generation data architecture following data warehouse and data lake, as introduced by DataCanvas). This enables the analysis of both structured and unstructured data through a singular SQL with exceptionally low latency in real time.

Projects about DingoDB

Welcome to visit DingoDB. The documentation of DingoDB is located on the website: https://dingodb.readthedocs.io. The main projects about DingoDB are as follows:

  • DingoDB: A Unified SQL Engine to parse and compute for both structured and unstructured data.
  • Dingo-Store: A strongly consistent distributed storage system based on the Raft protocol.
  • Dingo-Deploy: The deployment project of compute nodes and storage nodes.

Key Features

As a Distributed Vector Database for Any Data

  1. Provides comprehensive data storage solutions, accommodating a wide range of data types including but not limited to embeddings, audio files, text, videos, images, PDFs, and annotations.
  2. Facilitates efficient querying and vector searching with minimal latency using a singular SQL approach.
  3. Employs a hybrid search mechanism that caters to both structured and unstructured data, supporting operations like metadata querying and vector querying.
  4. Possesses the ability to dynamically ingest data and construct corresponding indexes in real time, promoting operational efficiency.

As a Distributed Relation database

  1. MySQL Compatibility Built upon the acclaimed Apache Calcite SQL engine, DingoDB is capable of parsing, optimizing, and executing standard SQL statements, and can handle parts of TPC-H and TPC-DS(See TPC) queries. Compliant with MySQL Shell and MySQL-JDBC-Driver Client, it offers seamless integration with web services, BI tools, and more.
  2. Supports High Frequency Write Operations: With the use of RAFT and the log-structured key-value storage RocksDB., DingoDB can handle high-frequency INSERT, UPDATE, DELETE, and short-QUERY operations while maintaining strong data consistency.
  3. Facilitates Point Queries and Multi-dimensional Analysis Simultaneously: DingoDB can push down expressions to accelerate queries and quickly carry out multi-dimensional analysis with low latency.
  4. Distributed Storage Capabilities As a distributed storage engine, DingoDB has the capacity to store vast amounts of data. It allows for easy horizontal scaling operations on clusters as data scale increases.
  5. High Data Reliability and Recovery: Designed based on Raft, DingoDB provides a multi-replicated management mechanism, ensuring extraordinarily high data reliability. It can maintain high data consistency even in the event of disk or machine failures and offers a swift automatic recovery mechanism.

Documentation

The documentation of DingoDB is located on the website: https://dingodb.readthedocs.io or in the docs/ directory of the source code.

Special Thanks

DingoDB is Sponsored by DataCanvas, a new platform to do data science and data process in real-time.

Pinned Loading

  1. dingo dingo Public

    A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency a…

    Java 944 193

  2. dingo-store dingo-store Public

    A distributed Key-Value Storage using Raft

    C++ 132 60

Repositories

Showing 10 of 33 repositories
  • dingo Public

    A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

    dingodb/dingo’s past year of commit activity
    Java 944 Apache-2.0 193 1 0 Updated Jul 16, 2024
  • pydingo Public

    Python SDK for DingoDB.

    dingodb/pydingo’s past year of commit activity
    Python 6 Apache-2.0 7 0 0 Updated Jul 16, 2024
  • curve Public Forked from ketor/curve

    Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file storage.

    dingodb/curve’s past year of commit activity
    C++ 0 Apache-2.0 517 0 0 Updated Jul 16, 2024
  • brpc Public Forked from apache/brpc

    brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" means "better RPC".

    dingodb/brpc’s past year of commit activity
    C++ 0 Apache-2.0 3,970 0 0 Updated Jul 16, 2024
  • dingo-store Public

    A distributed Key-Value Storage using Raft

    dingodb/dingo-store’s past year of commit activity
    C++ 132 Apache-2.0 60 0 1 Updated Jul 15, 2024
  • dingodb/dingo-sdk’s past year of commit activity
    C++ 1 Apache-2.0 4 0 0 Updated Jul 15, 2024
  • glog Public Forked from google/glog

    C++ implementation of the Google logging module

    dingodb/glog’s past year of commit activity
    C++ 0 BSD-3-Clause 2,124 0 0 Updated Jul 13, 2024
  • dingo-perftest-pysdk Public

    Perf test for pyDingo

    dingodb/dingo-perftest-pysdk’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Jul 13, 2024
  • tantivy-search Public Forked from ketor/tantivy-search

    Tantivy c++ wrapper for DingoDB

    dingodb/tantivy-search’s past year of commit activity
    Rust 0 Apache-2.0 3 0 0 Updated Jul 11, 2024
  • tantivy Public Forked from quickwit-oss/tantivy

    Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

    dingodb/tantivy’s past year of commit activity
    Rust 0 MIT 630 0 0 Updated Jul 11, 2024

Top languages

Loading…

Most used topics

Loading…