Roadmap 2023
tip
This is Databend Roadmap in 2023 🚀, sync from the #9448
After a full year of research and development in 2022, the functionality and stability of Databend were significantly enhanced, and several users began using it in production. Databend has helped them greatly reduce costs and operational complexity issues.
This is Databend Roadmap in 2023 (discussion).
See also:
Main tasks
Features
| Task | Status | Comments |
|---|---|---|
| Update#9261 | DONE | |
| Privileges | IN PROGRESS | |
| Alter table | IN PROGRESS | |
| Window function | PLAN | |
| Lambda function and high-order functions | PLAN | |
| TimestampTz data type | PLAN | |
| Materialized view | PLAN | |
| Support SET_VAR hints#8833 | PLAN | |
| Parquet reader | PLAN | |
| DataFrame | PLAN | |
| Data Sharing(community version) | IN PROGRESS | |
| Concurrent query enhance | PLAN | |
| Distributed COPY#8594 | PLAN | |
| Support Decimal data type#2931 | PLAN | high-priority |
Improvements
| Task | Status | Comments |
|---|---|---|
| New expression#9411 | DONE | |
| Error message | PLAN |
Planner
| Task | Status | Comments |
|---|---|---|
| Scalar expression normalization | PLAN | |
| Column constraint framework | PLAN | |
| Functional dependency framework#7438 | PLAN | |
| Join reorder | IN PROGRESS | |
| CBO | IN PROGRESS | |
| Support TPC-DS | PLAN | |
| Support optimization tracing | PLAN | Easy to debug/study. |
Cache
| Task | Status | Comments |
|---|---|---|
| Unified cache layer | IN PROGRESS | |
| Meta data cache | IN PROGRESS | |
| Index data cache | IN PROGRESS | |
| Block data cache | PLAN |
Data Storage
| Task | Status | Comments |
|---|---|---|
| Fuse engine re-clustering | PLAN | |
| Fuse engine orphan data cleanup | PLAN | |
| Fuse engine segment tree | PLAN | Support large dataset(PB) in one table |
Distributed Query Execution
| Task | Status | Comments |
|---|---|---|
| Visualized profiling | IN PROGRESS | |
| Aggregation spilling | IN PROGRESS |
Resource Quota
| Task | Status | Comments |
|---|---|---|
| Session-level quota control (CPU/Memory) | IN PROGRESS | |
| User-level quota control (CPU/Memory) | PLAN |
Schema-Less Search
| Task | Status | Comments |
|---|---|---|
| JSON indexing | PLAN | high-priority |
| Fulltext index#3915 | PLAN | high-priority |
| Array functions#7931 | PLAN | high-priority |
| Faiss index#9699 | PLAN |
LakeHouse
| Task | Status | Comments |
|---|---|---|
| Apache Hive | IN PROGRESS | |
| Apache Iceberg | IN PROGRESS | |
| Delta Lake | IN PROGRESS | |
| Querying external storage(Parquet) | IN PROGRESS |
Integrations
| Task | Status | Comments |
|---|---|---|
| Dbt integration | IN PROGRESS | |
| Airbyte integration | IN PROGRESS | |
| Datadog Vector integrate with Rust-driver | IN PROGRESS | |
| Datax integrate with Java-driver | IN PROGRESS | |
| CDC with Flink | PLAN | |
| CDC with Kafka | PLAN |
Meta
| Task | Status | Comments |
|---|---|---|
| Jepsen test | IN PROGRESS | |
| Store membership in raft | PLAN | |
| Nonblocking snapshot building | PLAN | |
| Snapshot file format impl | PLAN | |
| Upgrade on-disk store format | PLAN |
Testing
| Task | Status | Comments |
|---|---|---|
| SQLlogic Test | IN PROGRESS | Supports more test cases |
| SQLancer Test | IN PROGRESS | Supports more type and more cases |
| Fuzzer Test | PLAN |