01-intro.md 10.2 KB
Newer Older
D
dingbo 已提交
1
---
陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
2 3
sidebar_label: Introduction
title: TDengine Introduction
D
dingbo 已提交
4 5 6
toc_max_heading_level: 2
---

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
7
TDengine is a high-performance, scalable time-series database with SQL support. Its code, including its cluster feature is open source under GNU AGPL v3.0. Besides the database engine, it provides caching, stream processing, data subscription and other functionalities to reduce the complexity and cost of development and operation. TDengine differentiates itself from other TSDBs with the following advantages.
D
dingbo 已提交
8

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
9
- **High Performance**: TDengine outperforms other time series databases in data ingestion and querying while significantly reducing storage cost and compute costs, with an innovatively designed and purpose-built storage engine.
D
dingbo 已提交
10

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
11
- **Scalable**: TDengine provides out-of-box scalability and high-availability through its native distributed design. Nodes can be added through simple configuration to achieve greater data processing power. In addition, this feature is open source.
D
dingbo 已提交
12

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
13
- **SQL Support**: TDengine uses SQL as the query language, thereby reducing learning and migration costs, while adding SQL extensions to handle time-series data better, and supporting convenient and flexible schemaless data ingestion.
D
dingbo 已提交
14

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
15
- **All in One**: TDengine has built-in caching, stream processing and data subscription functions. It is no longer necessary to integrate Kafka/Redis/HBase/Spark or other software in some scenarios. It makes the system architecture much simpler, cost-effective and easier to maintain.
D
dingbo 已提交
16

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
17
- **Seamless Integration**: Without a single line of code, TDengine provide seamless, configurable integration with third-party tools such as Telegraf, Grafana, EMQX, Prometheus, StatsD, collectd, etc. More third-party tools are being integrated.
D
dingbo 已提交
18

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
19
- **Zero Management**: Installation and cluster setup can be done in seconds. Data partitioning and sharding are executed automatically. TDengine’s running status can be monitored via Grafana or other DevOps tools.
D
dingbo 已提交
20

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
21
- **Zero Learning Cost**: With SQL as the query language, support for ubiquitous tools like Python, Java, C/C++, Go, Rust, Node.js connectors, there is zero learning cost.
D
dingbo 已提交
22

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
23
- **Interactive Console**: TDengine provides convenient console access to the database to run ad hoc queries, maintain the database, or manage the cluster without any programming.
D
dingbo 已提交
24

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
25
With TDengine, the total cost of ownership of time-seriess data platform can be greatly reduced. Because 1: with its superior performance, the computing and storage resources are reduced significantly; 2:with SQL support, it can be seamlessly integrated with many third party tools, and learning cost/migration cost is reduced significantly; 3: with its simple architecture and zero management, the operation and maintainence cost is reduced. 
D
dingbo 已提交
26

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
27
In the time-series data processing platform, TDengine stands in a role like this diagram below:
D
dingbo 已提交
28

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
29
![TDengine Technical Ecosystem ](eco_system.png)
D
dingbo 已提交
30

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
31
<center>Figure 1. TDengine Technical Ecosystem</center>
D
dingbo 已提交
32

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
33
## Suited Scenarios for TDengine
D
dingbo 已提交
34

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
35
As a high-performance, scalable and SQL supported time-series database, TDengine's typical application scenarios include but are not limited to IoT, Industrial Internet, Connected Vehicles, IT operation and maintenance, energy, financial market and other fields. But you shall note that TDengine is a purpose-built database and does tons of optimization based on the characteristics of time series data, it cannot be used to process data from web crawlers, social media, e-commerce, ERP, CRM, etc. This section makes a more detailed analysis of the applicable scenarios.
D
dingbo 已提交
36

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
37
### Characteristics and Requirements of Data Sources
D
dingbo 已提交
38

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
39
From the perspective of data sources, designers can analyze the applicability of TDengine in target application systems as follows.
D
dingbo 已提交
40

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
41 42 43 44 45
| **Data Source Characteristics and Requirements**         | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| -------------------------------------------------------- | ------------------ | ----------------------- | ------------------- | :----------------------------------------------------------- |
| A massive amount of total data                              |                    |                         | √                   | TDengine provides excellent scale-out functions in terms of capacity, and has a storage structure with matching high compression ratio to achieve the best storage efficiency in the industry.|
| Data input velocity is extremely high |                    |                         | √                   | TDengine's performance is much higher than that of other similar products. It can continuously process larger amounts of input data in the same hardware environment, and provides a performance evaluation tool that can easily run in the user environment. |
| A huge number of data sources                            |                    |                         | √                   | TDengine is optimized specifically for a huge number of data sources. It is especially suitable for efficiently ingesting, writing and querying data from billions of data sources. |
D
dingbo 已提交
46

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
47
### System Architecture Requirements
D
dingbo 已提交
48

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
49 50 51 52 53
| **System Architecture Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| A simple and reliable system architecture |                    |                         | √                   | TDengine's system architecture is very simple and reliable, with its own message queue, cache, stream computing, monitoring and other functions. There is no need to integrate any additional third-party products. |
| Fault-tolerance and high-reliability      |                    |                         | √                   | TDengine has cluster functions to automatically provide high-reliability and high-availability functions such as fault tolerance and disaster recovery. |
| Standardization support                    |                    |                         | √                   | TDengine supports standard SQL and also provides extensions specifically to analyze time-series data. |
D
dingbo 已提交
54

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
55
### System Function Requirements
D
dingbo 已提交
56

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
57 58 59 60
| **System Function Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Complete data processing algorithms built-in |                    | √                    |                     | While TDengine implements various general data processing algorithms, industry specific algorithms and special types of processing will need to be implemented at the application level.|
| A large number of crosstab queries             |                    | √                    |                     | This type of processing is better handled by general purpose relational database systems but TDengine can work in concert with relational database systems to provide more complete solutions. |
D
dingbo 已提交
61

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
62
### System Performance Requirements
D
dingbo 已提交
63

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
64 65 66 67 68
| **System Performance Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Very large total processing capacity     |                    |                      | √                   | TDengine’s cluster functions can easily improve processing capacity via multi-server coordination. |
| Extremely high-speed data processing           |                    |                      | √                   | TDengine’s storage and data processing are optimized for IoT, and can process data many times faster than similar products.|
| Extremely fast processing of fine-grained data |                    |                      | √                   | TDengine has achieved the same or better performance than other relational and NoSQL data processing systems. |
D
dingbo 已提交
69

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
70
### System Maintenance Requirements
D
dingbo 已提交
71

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
72 73 74 75 76
| **System Maintenance Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Native high-reliability         |                    |                      | √                   | TDengine has a very robust, reliable and easily configurable system architecture to simplify routine operation. Human errors and accidents are eliminated to the greatest extent, with a streamlined experience for operators. |
| Minimize learning and maintenance costs |                    |                      | √                   | In addition to being easily configurable, standard SQL support and the Taos shell for ad hoc queries makes maintenance simpler, allows reuse and reduces learning costs.|
| Abundant talent supply               | √                  |                      |                     | Given the above, and given the extensive training and professional services provided by TDengine, it is easy to migrate from existing solutions or create a new and lasting solution based on TDengine.|
D
dingbo 已提交
77

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
78
## Benchmark comparision between TDengine and other databases
D
dingbo 已提交
79

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
80 81 82 83 84 85
- [Writing Performance Comparison of TDengine and InfluxDB ](https://tdengine.com/2022/02/23/4975.html)
- [Query Performance Comparison of TDengine and InfluxDB](https://tdengine.com/2022/02/24/5120.html)
- [TDengine vs InfluxDB、OpenTSDB、Cassandra、MySQL、ClickHouse](https://www.tdengine.com/downloads/TDengine_Testing_Report_en.pdf)
- [TDengine vs OpenTSDB](https://tdengine.com/2019/09/12/710.html)
- [TDengine vs Cassandra](https://tdengine.com/2019/09/12/708.html)
- [TDengine vs InfluxDB](https://tdengine.com/2019/09/12/706.html)