11-optimize.md 9.3 KB
Newer Older
1 2 3 4
---
title: Performance Optimization
---

5
After a TDengine cluster has been running for a long enough time, because of data insertion, table deletion and deletion of expired data, there may be fragments in data files and query performance may be impacted. To resolve the problem of fragments, since version 2.1.3.0 a new SQL command `COMPACT` can be used to defragment data files.
6 7 8 9 10

```sql
COMPACT VNODES IN (vg_id1, vg_id2, ...)
```

11
`COMPACT` can be used to defragment one or more vgroups. The defragmentation work will be scheduled in the task queue for execution by TDengine. `SHOW VGROUPS` command can be used to get the vgroup ids to be used in `COMPACT` command. There is a column `compacting` in the output of `SHOW GROUPS` to indicate the compaction status of the vgroup: 2 means the vgroup is waiting in task queue for compaction, 1 means compaction is in progress, and 0 means the vgroup has not been scheduled for compaction.
12

13
Please note that a lot of disk I/O is required for defragementation operations. During defragmentation the performance may be impacted significantly for data insertion and query. Data insertion may even be blocked for very short periods, in extreme cases.
14 15 16

## Optimize Storage Parameters

17
The data in different use cases may have different characteristics, such as the days to keep, number of replicas, collection interval, record size, number of collection points, compression or not, etc. To achieve best efficiency in storage, the parameters in the table below can be used. All of them can either be configured in `taos.cfg`, as default parameters, or can be set in the command `create database`. For detailed definition of these parameters please refer to [Configuration Parameters](/reference/config/).
18 19 20 21 22 23 24

| #   | Parameter | Unit | Definition                                                                     | **Value Range**                                                                                 | **Default Value** |
| --- | --------- | ---- | ------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------------------- | ----------------- |
| 1   | days      | Day  | The time range of the data stored in a single data file                        | 1-3650                                                                                          | 10                |
| 2   | keep      | Day  | The number of days the data is kept in the database                            | 1-36500                                                                                         | 3650              |
| 3   | cache     | MB   | The size of each memory block                                                  | 1-128                                                                                           | 16                |
| 4   | blocks    | None | The number of memory blocks used by each vnode                                 | 3-10000                                                                                         | 6                 |
25
| 5   | quorum    | None | The number of required confirmations in case of multiple replicas               | 1-2                                                                                             | 1                 |
26
| 6   | minRows   | None | The minimum number of rows in a data file                                      | 10-1000                                                                                         | 100               |
27
| 7   | maxRows   | None | The maximum number of rows in a data file                                       | 200-10000                                                                                       | 4096              |
28 29
| 8   | comp      | None | Whether to compress the data                                                   | 0:uncompressed; 1: One Phase compression; 2: Two Phase compression                             | 2                 |
| 9   | walLevel  | None | wal sync level (named as "wal" in create database )                            | 1:wal enabled without fsync; 2:wal enabled with fsync                                         | 1                 |
30
| 10  | fsync     | ms   | The time to wait for invoking fsync when walLevel is set to 2; 0 means no wait | 0-3000                                                                                            |
31 32
| 11  | replica   | none | The number of replications                                                     | 1-3                                                                                             | 1                 |
| 12  | precision | none | Time precision                                                                 | ms: millisecond; us: microsecond;ns: nanosecond                                                 | ms                |
33
| 13  | update    | none | Whether to allow updating data                                                 | 0: not allowed; 1: a whole row must be updated; 2: a portion of columns in a row can be updated | 0                 |
34 35
| 14  | cacheLast | none | Whether the latest data of a table is cached in memory                         | 0: not cached; 1: the last row is cached; 2: the latest non-NULL value of each column is cached | 0                 |

36
Even for a specific use case, there may be multiple kinds of data with different characteristics. In this case it's best to put data with the same characteristics in the same database. There may be multiple databases in a system and each database can be configured with different storage parameters to achieve the best performance. The above parameters can be used when creating a database to override the default setting in the configuration file.
37 38 39 40 41

```sql
 CREATE DATABASE demo DAYS 10 CACHE 32 BLOCKS 8 REPLICA 3 UPDATE 1;
```

42
The above SQL statement creates a database named `demo`, in which each data file stores 10 days of data, the size of each memory block is 32 MB and 8 blocks are allocated to each vnode, there are 3 replicas and update operations are allowed. All other parameters not specified in the command, will default to the values in the configuration file `taos.cfg`.
43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69

Once a database is created, only some parameters can be changed and be effective immediately while others are can't.

| **Parameter** | **Alterable** | **Value Range**  | **Syntax**                             |
| ------------- | ------------- | ---------------- | -------------------------------------- |
| name          |               |                  |                                        |
| create time   |               |                  |                                        |
| ntables       |               |                  |                                        |
| vgroups       |               |                  |                                        |
| replica       | **YES**       | 1-3              | ALTER DATABASE <dbname\> REPLICA _n_   |
| quorum        | **YES**       | 1-2              | ALTER DATABASE <dbname\> QUORUM _n_    |
| days          |               |                  |                                        |
| keep          | **YES**       | days-365000      | ALTER DATABASE <dbname\> KEEP _n_      |
| cache         |               |                  |                                        |
| blocks        | **YES**       | 3-1000           | ALTER DATABASE <dbname\> BLOCKS _n_    |
| minrows       |               |                  |                                        |
| maxrows       |               |                  |                                        |
| wal           |               |                  |                                        |
| fsync         |               |                  |                                        |
| comp          | **YES**       | 0-2              | ALTER DATABASE <dbname\> COMP _n_      |
| precision     |               |                  |                                        |
| status        |               |                  |                                        |
| update        |               |                  |                                        |
| cachelast     | **YES**       | 0 \| 1 \| 2 \| 3 | ALTER DATABASE <dbname\> CACHELAST _n_ |

**Explanation:** Prior to version 2.1.3.0, `taosd` server process needs to be restarted for these parameters to take in effect if they are changed using `ALTER DATABASE`.

70
When trying to join a new dnode into a running TDengine cluster, all the parameters related to the cluster in the new dnode configuration must be consistent with the cluster, otherwise it can't join the cluster. The parameters that are checked when joining a dnode are listed below. For detailed definition of these parameters please refer to [Configuration Parameters](/reference/config/).
71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92

- numOfMnodes
- mnodeEqualVnodeNum
- offlineThreshold
- statusInterval
- maxTablesPerVnode
- maxVgroupsPerDb
- arbitrator
- timezone
- balance
- flowctrl
- slaveQuery
- adjustMaster

For the convenience of debugging, the log setting of a dnode can be changed temporarily. The temporary change will be lost once the server is restarted.

```sql
ALTER DNODE <dnode_id> <config>
```

- dnode_id: from output of "SHOW DNODES"
- config: the parameter to be changed, as below
93
  - resetlog: close the old log file and create the new one
94 95
  - debugFlag: 131 (INFO/ERROR/WARNING), 135 (DEBUG), 143 (TRACE)

96
For example:
97 98 99 100

```
alter dnode 1 debugFlag 135;
```