01-intro.md 13.1 KB
Newer Older
D
dingbo 已提交
1
---
D
dingbo 已提交
2
title: Introduction
D
dingbo 已提交
3 4 5
toc_max_heading_level: 2
---

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
## TDengine Major Features

TDengine is a high-performance, scalable time-series database with SQL support. Its code, including its cluster feature is open source under GNU AGPL v3.0. Besides the database engine, it provides [caching](/develop/cache), [stream processing](/develop/continuous-query), [data subscription](/develop/subscribe)  and other functionalities to reduce the complexity and cost of development and operation. The major features are listed below:

1. Besides [using SQL to insert](/develop/insert-data/sql-writing),supports [Schemaless writing](/reference/schemaless/),and supports [InfluxDB LINE](/develop/insert-data/influxdb-line)[OpenTSDB Telnet](/develop/insert-data/opentsdb-telnet), [OpenTSDB JSON ](/develop/insert-data/opentsdb-json) and other protocols.
2. Support seamless integration with third-party data collection agent like [Telegraf](/third-party/telegraf)[Prometheus](/third-party/prometheus)[StatsD](/third-party/statsd)[collectd](/third-party/collectd)[icinga2](/third-party/icinga2), [Tcollector](/third-party/tcollector), [EMQ](/third-party/emq-broker), [HiveMQ](/third-party/hive-mq-broker). Without a line of code, those agents can write data points into TDengine just by configuration. 
3. Support [all kinds of queries](/query-data), including aggregation, nested query, downsampling, interpolation, etc.
4. Support [user defined functions](/develop/udf)
5. Support [caching](/develop/cache). TDengine always save the last data point in cache, so Redis is not needed in some scenarios.
6. Support [continuous query](/develop/continuous-query).
7. Support [data subscription](/develop/subscribe),and the filter condition can be specified.
8. Support [cluster](/cluster/), so it can gain more processing by adding more nodes. The high availability is supported by replication. 
9. Provide interactive [command line intrerface](/reference/taos-shell) for management, maintainence and ad-hoc query.
10. Provide many ways to [import](/operation/import), [export](/operation/export) data.
11. Provide [monitoring](/operation/monitor) on TDengine running instances.
12. Provide [connectors](/reference/connector/) for [C/C++](/reference/connector/cpp), [Java](/reference/connector/java), [Python](/reference/connector/python), [Go](/reference/connector/go), [Rust](/reference/connector/rust), [Node.js](/reference/connector/node) and other programming languages.
13. Provide [REST API](/reference/rest-api/).
14. Support the seamless integration with [Grafana](/third-party/grafana) for visualization.
15. Support seamless integration with Google Data Studio.

For more detailed features, please read through the whole document. 

## TDenginge Highlights

TDengine makes full use of the [characteristics of time series data](https://tdengine.com/2019/07/09/86.html), such as structured, no transaction, rarely delete or update, etc., and builds its own innovative storage engine and computing engine to differentiate itself from other TSDBs with the following advantages.
D
dingbo 已提交
31

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
32
- **High Performance**: TDengine outperforms other time series databases in data ingestion and querying while significantly reducing storage cost and compute costs, with an innovatively designed and purpose-built storage engine.
D
dingbo 已提交
33

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
34
- **Scalable**: TDengine provides out-of-box scalability and high-availability through its native distributed design. Nodes can be added through simple configuration to achieve greater data processing power. In addition, this feature is open source.
D
dingbo 已提交
35

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
36
- **SQL Support**: TDengine uses SQL as the query language, thereby reducing learning and migration costs, while adding SQL extensions to handle time-series data better, and supporting convenient and flexible schemaless data ingestion.
D
dingbo 已提交
37

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
38
- **All in One**: TDengine has built-in caching, stream processing and data subscription functions. It is no longer necessary to integrate Kafka/Redis/HBase/Spark or other software in some scenarios. It makes the system architecture much simpler, cost-effective and easier to maintain.
D
dingbo 已提交
39

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
40
- **Seamless Integration**: Without a single line of code, TDengine provide seamless, configurable integration with third-party tools such as Telegraf, Grafana, EMQX, Prometheus, StatsD, collectd, etc. More third-party tools are being integrated.
D
dingbo 已提交
41

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
42
- **Zero Management**: Installation and cluster setup can be done in seconds. Data partitioning and sharding are executed automatically. TDengine’s running status can be monitored via Grafana or other DevOps tools.
D
dingbo 已提交
43

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
44
- **Zero Learning Cost**: With SQL as the query language, support for ubiquitous tools like Python, Java, C/C++, Go, Rust, Node.js connectors, there is zero learning cost.
D
dingbo 已提交
45

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
46
- **Interactive Console**: TDengine provides convenient console access to the database to run ad hoc queries, maintain the database, or manage the cluster without any programming.
D
dingbo 已提交
47

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
48
With TDengine, the total cost of ownership of time-seriess data platform can be greatly reduced. Because 1: with its superior performance, the computing and storage resources are reduced significantly; 2:with SQL support, it can be seamlessly integrated with many third party tools, and learning cost/migration cost is reduced significantly; 3: with its simple architecture and zero management, the operation and maintainence cost is reduced. 
D
dingbo 已提交
49

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
50
## TDengine Technical Ecosystem
陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
51
In the time-series data processing platform, TDengine stands in a role like this diagram below:
D
dingbo 已提交
52

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
53
![TDengine Technical Ecosystem ](eco_system.png)
D
dingbo 已提交
54

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
55
<center>Figure 1. TDengine Technical Ecosystem</center>
D
dingbo 已提交
56

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
57 58
On the left side, there are data collection agents like OPC-UA, MQTT, Telegraf and Kafka. On the right side, visualization/BI tools, HMI, Python/R, IoT App can be connected. TDengine itself provides interactive command line interface and web interface for management and maintainence.

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
59
## Suited Scenarios for TDengine
D
dingbo 已提交
60

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
61
As a high-performance, scalable and SQL supported time-series database, TDengine's typical application scenarios include but are not limited to IoT, Industrial Internet, Connected Vehicles, IT operation and maintenance, energy, financial market and other fields. But you shall note that TDengine is a purpose-built database and does tons of optimization based on the characteristics of time series data, it cannot be used to process data from web crawlers, social media, e-commerce, ERP, CRM, etc. This section makes a more detailed analysis of the applicable scenarios.
D
dingbo 已提交
62

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
63
### Characteristics and Requirements of Data Sources
D
dingbo 已提交
64

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
65
From the perspective of data sources, designers can analyze the applicability of TDengine in target application systems as follows.
D
dingbo 已提交
66

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
67 68 69 70 71
| **Data Source Characteristics and Requirements**         | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| -------------------------------------------------------- | ------------------ | ----------------------- | ------------------- | :----------------------------------------------------------- |
| A massive amount of total data                              |                    |                         | √                   | TDengine provides excellent scale-out functions in terms of capacity, and has a storage structure with matching high compression ratio to achieve the best storage efficiency in the industry.|
| Data input velocity is extremely high |                    |                         | √                   | TDengine's performance is much higher than that of other similar products. It can continuously process larger amounts of input data in the same hardware environment, and provides a performance evaluation tool that can easily run in the user environment. |
| A huge number of data sources                            |                    |                         | √                   | TDengine is optimized specifically for a huge number of data sources. It is especially suitable for efficiently ingesting, writing and querying data from billions of data sources. |
D
dingbo 已提交
72

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
73
### System Architecture Requirements
D
dingbo 已提交
74

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
75 76 77 78 79
| **System Architecture Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| A simple and reliable system architecture |                    |                         | √                   | TDengine's system architecture is very simple and reliable, with its own message queue, cache, stream computing, monitoring and other functions. There is no need to integrate any additional third-party products. |
| Fault-tolerance and high-reliability      |                    |                         | √                   | TDengine has cluster functions to automatically provide high-reliability and high-availability functions such as fault tolerance and disaster recovery. |
| Standardization support                    |                    |                         | √                   | TDengine supports standard SQL and also provides extensions specifically to analyze time-series data. |
D
dingbo 已提交
80

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
81
### System Function Requirements
D
dingbo 已提交
82

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
83 84 85 86
| **System Function Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Complete data processing algorithms built-in |                    | √                    |                     | While TDengine implements various general data processing algorithms, industry specific algorithms and special types of processing will need to be implemented at the application level.|
| A large number of crosstab queries             |                    | √                    |                     | This type of processing is better handled by general purpose relational database systems but TDengine can work in concert with relational database systems to provide more complete solutions. |
D
dingbo 已提交
87

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
88
### System Performance Requirements
D
dingbo 已提交
89

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
90 91 92 93 94
| **System Performance Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Very large total processing capacity     |                    |                      | √                   | TDengine’s cluster functions can easily improve processing capacity via multi-server coordination. |
| Extremely high-speed data processing           |                    |                      | √                   | TDengine’s storage and data processing are optimized for IoT, and can process data many times faster than similar products.|
| Extremely fast processing of fine-grained data |                    |                      | √                   | TDengine has achieved the same or better performance than other relational and NoSQL data processing systems. |
D
dingbo 已提交
95

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
96
### System Maintenance Requirements
D
dingbo 已提交
97

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
98 99 100 101 102
| **System Maintenance Requirements**              | **Not Applicable** | **Might Be Applicable** | **Very Applicable** | **Description**                                              |
| ------------------------------------------------- | ------------------ | ----------------------- | ------------------- | ------------------------------------------------------------ |
| Native high-reliability         |                    |                      | √                   | TDengine has a very robust, reliable and easily configurable system architecture to simplify routine operation. Human errors and accidents are eliminated to the greatest extent, with a streamlined experience for operators. |
| Minimize learning and maintenance costs |                    |                      | √                   | In addition to being easily configurable, standard SQL support and the Taos shell for ad hoc queries makes maintenance simpler, allows reuse and reduces learning costs.|
| Abundant talent supply               | √                  |                      |                     | Given the above, and given the extensive training and professional services provided by TDengine, it is easy to migrate from existing solutions or create a new and lasting solution based on TDengine.|
D
dingbo 已提交
103

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
104
## Benchmark comparision between TDengine and other databases
D
dingbo 已提交
105

陶建辉(Jeff)'s avatar
陶建辉(Jeff) 已提交
106 107 108 109 110 111
- [Writing Performance Comparison of TDengine and InfluxDB ](https://tdengine.com/2022/02/23/4975.html)
- [Query Performance Comparison of TDengine and InfluxDB](https://tdengine.com/2022/02/24/5120.html)
- [TDengine vs InfluxDB、OpenTSDB、Cassandra、MySQL、ClickHouse](https://www.tdengine.com/downloads/TDengine_Testing_Report_en.pdf)
- [TDengine vs OpenTSDB](https://tdengine.com/2019/09/12/710.html)
- [TDengine vs Cassandra](https://tdengine.com/2019/09/12/708.html)
- [TDengine vs InfluxDB](https://tdengine.com/2019/09/12/706.html)