07-python.mdx 20.4 KB
Newer Older
1 2
---
title: TDengine Python Connector
D
danielclow 已提交
3 4
sidebar_label: Python
description: This document describes taospy, the TDengine Python connector.
5 6 7 8 9
---

import Tabs from "@theme/Tabs";
import TabItem from "@theme/TabItem";

10
`taospy` is the official Python connector for TDengine. taospy provides a rich API that makes it easy for Python applications to use TDengine. `taospy` wraps both the [native interface](/reference/connector/cpp) and [REST interface](/reference/rest-api) of TDengine, which correspond to the `taos` and `taosrest` modules of the `taospy` package, respectively.
11 12
In addition to wrapping the native and REST interfaces, `taospy` also provides a set of programming interfaces that conforms to the [Python Data Access Specification (PEP 249)](https://peps.python.org/pep-0249/). It is easy to integrate `taospy` with many third-party tools, such as [SQLAlchemy](https://www.sqlalchemy.org/) and [pandas](https://pandas.pydata.org/).

13
`taos-ws-py` is an optional package to enable using WebSocket to connect TDengine.
14

15
The direct connection to the server using the native interface provided by the client driver is referred to hereinafter as a "native connection"; the connection to the server using the REST or WebSocket interface provided by taosAdapter is referred to hereinafter as a "REST connection" or "WebSocket connection".
16

17
The source code for the Python connector is hosted on [GitHub](https://github.com/taosdata/taos-connector-python).
D
danielclow 已提交
18
## Supported platforms
19

20
- The [supported platforms](/reference/connector/#supported-platforms) for the native connection are the same as the ones supported by the TDengine client.
21 22 23 24
- REST connections are supported on all platforms that can run Python.

## Version selection

25
We recommend using the latest version of `taospy`, regardless of the version of TDengine.
26 27 28 29 30 31 32 33 34 35

## Supported features

- Native connections support all the core features of TDengine, including connection management, SQL execution, bind interface, subscriptions, and schemaless writing.
- REST connections support features such as connection management and SQL execution. (SQL execution allows you to: manage databases, tables, and supertables, write data, query data, create continuous queries, etc.).

## Installation

### Preparation

36
1. Install Python. The recent taospy package requires Python 3.6.2+. The earlier versions of taospy require Python 3.7+.  The taos-ws-py package requires Python 3.7+. If Python is not available on your system, refer to the [Python BeginnersGuide](https://wiki.python.org/moin/BeginnersGuide/Download) to install it.
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54
2. Install [pip](https://pypi.org/project/pip/). In most cases, the Python installer comes with the pip utility. If not, please refer to [pip documentation](https://pip.pypa.io/en/stable/installation/) to install it.
If you use a native connection, you will also need to [Install Client Driver](/reference/connector#Install-Client-Driver). The client install package includes the TDengine client dynamic link library (`libtaos.so` or `taos.dll`) and the TDengine CLI.

### Install via pip

#### Uninstalling an older version

If you have installed an older version of the Python Connector, please uninstall it beforehand.

```
pip3 uninstall taos taospy
```

:::note
Earlier TDengine client software includes the Python connector. If the Python connector is installed from the client package's installation directory, the corresponding Python package name is `taos`. So the above uninstall command includes `taos`, and it doesn't matter if it doesn't exist.

:::

55
#### To install `taospy`
56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81

<Tabs>
<TabItem label="Install from PyPI" value="pypi">

Install the latest version of:

```
pip3 install taospy
```

You can also specify a specific version to install:

```
pip3 install taospy==2.3.0
```

</TabItem>
<TabItem label="Install from GitHub" value="github">

```
pip3 install git+https://github.com/taosdata/taos-connector-python.git
```

</TabItem>
</Tabs>

82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97
#### Install `taos-ws-py` (Optional)

The taos-ws-py package provides the way to access TDengine via WebSocket.

##### Install taos-ws-py with taospy

```bash
pip3 install taospy[ws]
```

##### Install taos-ws-py only

```bash
pip3 install taos-ws-py
```

D
danielclow 已提交
98
### Verify
99

G
gccgdb1234 已提交
100
<Tabs defaultValue="rest">
G
gccgdb1234 已提交
101
<TabItem value="native" label="native connection">
102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118

For native connection, you need to verify that both the client driver and the Python connector itself are installed correctly. The client driver and Python connector have been installed properly if you can successfully import the `taos` module. In the Python Interactive Shell, you can type.

```python
import taos
```

</TabItem>
<TabItem value="rest" label="REST connection">

For REST connections, verifying that the `taosrest` module can be imported successfully can be done in the Python Interactive Shell by typing.

```python
import taosrest
```

</TabItem>
119 120 121 122 123 124 125 126
<TabItem  value="ws" label="WebSocket connection">

For WebSocket connection, verifying that the `taosws` module can be imported successfully can be done in the Python Interactive Shell by typing.

```python
import taosws
```

127
</TabItem>
128 129 130 131 132 133 134 135 136
</Tabs>

:::tip
If you have multiple versions of Python on your system, you may have various `pip` commands. Be sure to use the correct path for the `pip` command. Above, we installed the `pip3` command, which rules out the possibility of using the `pip` corresponding to Python 2.x versions. However, if you have more than one version of Python 3.x on your system, you still need to check that the installation path is correct. The easiest way to verify this is to type `pip3 install taospy` again in the command, and it will print out the exact location of `taospy`, for example, on Windows.

```
C:\> pip3 install taospy
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Requirement already satisfied: taospy in c:\users\username\appdata\local\programs\python\python310\lib\site-packages (2.3.0)
D
danielclow 已提交
137
```
138 139 140

:::

D
danielclow 已提交
141
## Establishing a connection
142 143 144 145 146

### Connectivity testing

Before establishing a connection with the connector, we recommend testing the connectivity of the local TDengine CLI to the TDengine cluster.

G
gccgdb1234 已提交
147
<Tabs defaultValue="rest">
148 149
<TabItem value="native" label="native connection">

150
Ensure that the TDengine instance is up and that the FQDN of the machines in the cluster (the FQDN defaults to hostname if you are starting a stand-alone version) can be resolved locally, by testing with the `ping` command.
151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166

```
ping <FQDN>
```

Then test if the cluster can be appropriately connected with TDengine CLI:

```
taos -h <FQDN> -p <PORT>
```

The FQDN above can be the FQDN of any dnode in the cluster, and the PORT is the serverPort corresponding to this dnode.

</TabItem>
<TabItem value="rest" label="REST connection" groupId="connect">

167
For REST connections, make sure the cluster and taosAdapter component, are running. This can be tested using the following `curl ` command.
168 169 170 171 172 173 174 175 176 177

```
curl -u root:taosdata http://<FQDN>:<PORT>/rest/sql -d "select server_version()"
```

The FQDN above is the FQDN of the machine running taosAdapter, PORT is the port taosAdapter listening, default is `6041`.
If the test is successful, it will output the server version information, e.g.

```json
{
178 179 180 181 182 183 184 185 186 187 188 189 190
  "code": 0,
  "column_meta": [
    [
      "server_version()",
      "VARCHAR",
      7
    ]
  ],
  "data": [
    [
      "3.0.0.0"
    ]
  ],
191 192 193 194
  "rows": 1
}
```

195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216
</TabItem>
<TabItem value="ws" label="WebSocket connection" groupId="connect">

For WebSocket connection, make sure the cluster and taosAdapter component, are running. This can be testetd using the following `curl` command.

```
curl -i -N -d "show databases" -H "Authorization: Basic cm9vdDp0YW9zZGF0YQ==" -H "Connection: Upgrade" -H "Upgrade: websocket" -H "Host: <FQDN>:<PORT>" -H "Origin: http://<FQDN>:<PORT>" http://<FQDN>:<PORT>/rest/sql
```

The FQDN above is the FQDN of the machine running taosAdapter, PORT is the port taosAdapter listening, default is `6041`.

If the test is successful, it will output the server version information, e.g.

```json
HTTP/1.1 200 OK
Content-Type: application/json; charset=utf-8
Date: Tue, 21 Mar 2023 09:29:17 GMT
Transfer-Encoding: chunked

{"status":"succ","head":["server_version()"],"column_meta":[["server_version()",8,8]],"data":[["2.6.0.27"]],"rows":1}
```

217 218 219 220 221 222 223
</TabItem>
</Tabs>

### Using connectors to establish connections

The following example code assumes that TDengine is installed locally and that the default configuration is used for both FQDN and serverPort.

G
gccgdb1234 已提交
224
<Tabs defaultValue="rest">
225 226 227
<TabItem value="native" label="native connection" groupId="connect">

```python
D
dingbo 已提交
228
{{#include docs/examples/python/connect_native_reference.py}}
229 230 231 232 233 234 235 236
```

All arguments of the `connect()` function are optional keyword arguments. The following are the connection parameters specified.

- `host` : The FQDN of the node to connect to. There is no default value. If this parameter is not provided, the firstEP in the client configuration file will be connected.
- `user` : The TDengine user name. The default value is `root`.
- `password` : TDengine user password. The default value is `taosdata`.
- `port` : The starting port of the data node to connect to, i.e., the serverPort configuration. The default value is 6030, which will only take effect if the host parameter is provided.
wafwerar's avatar
wafwerar 已提交
237
- `config` : The path to the client configuration file. On Windows systems, the default is `C:\TDengine\cfg`. The default is `/etc/taos/` on Linux/macOS.
238 239 240 241 242 243 244 245 246 247 248 249 250 251 252
- `timezone` : The timezone used to convert the TIMESTAMP data in the query results to python `datetime` objects. The default is the local timezone.

:::warning
`config` and `timezone` are both process-level configurations. We recommend that all connections made by a process use the same parameter values. Otherwise, unpredictable errors may occur.
:::

:::tip
The `connect()` function returns a `taos.TaosConnection` instance. In client-side multi-threaded scenarios, we recommend that each thread request a separate connection instance rather than sharing a connection between multiple threads.

:::

</TabItem>
<TabItem value="rest" label="REST connection">

```python
D
dingbo 已提交
253
{{#include docs/examples/python/connect_rest_examples.py:connect}}
254 255 256 257
```

All arguments to the `connect()` function are optional keyword arguments. The following are the connection parameters specified.

B
Bo Ding 已提交
258
- `url`: The URL of taosAdapter REST service. The default is <http://localhost:6041>.
259 260
- `user`: TDengine user name. The default is `root`.
- `password`: TDengine user password. The default is `taosdata`.
D
danielclow 已提交
261
- `timeout`: HTTP request timeout. Enter a value in seconds. The default is `socket._GLOBAL_DEFAULT_TIMEOUT`. Usually, no configuration is needed.
262

263 264 265 266 267 268 269 270 271 272
</TabItem>

<TabItem value="websocket" label="WebSocket connection">

```python
{{#include docs/examples/python/connect_websocket_examples.py:connect}}
```

The parameter of `connect()` is the url of TDengine, and the protocol is `taosws` or `ws`.

273 274 275
</TabItem>
</Tabs>

D
danielclow 已提交
276
## Example program
277 278 279

### Basic Usage

G
gccgdb1234 已提交
280
<Tabs defaultValue="rest">
281 282 283 284 285 286 287
<TabItem value="native" label="native connection">

##### TaosConnection class

The `TaosConnection` class contains both an implementation of the PEP249 Connection interface (e.g., the `cursor()` method and the `close()` method) and many extensions (e.g., the `execute()`, `query()`, `schemaless_insert()`, and `subscribe()` methods).

```python title="execute method"
D
dingbo 已提交
288
{{#include docs/examples/python/connection_usage_native_reference.py:insert}}
289 290 291
```

```python title="query method"
D
dingbo 已提交
292
{{#include docs/examples/python/connection_usage_native_reference.py:query}}
293 294 295 296 297 298 299 300 301 302 303
```

:::tip
The queried results can only be fetched once. For example, only one of `fetch_all()` and `fetch_all_into_dict()` can be used in the example above. Repeated fetches will result in an empty list.
:::

##### Use of TaosResult class

In the above example of using the `TaosConnection` class, we have shown two ways to get the result of a query: `fetch_all()` and `fetch_all_into_dict()`. In addition, `TaosResult` also provides methods to iterate through the result set by rows (`rows_iter`) or by data blocks (`blocks_iter`). Using these two methods will be more efficient in scenarios where the query has a large amount of data.

```python title="blocks_iter method"
D
dingbo 已提交
304
{{#include docs/examples/python/result_set_examples.py}}
305 306 307 308 309 310
```
##### Use of the TaosCursor class

The `TaosConnection` class and the `TaosResult` class already implement all the functionality of the native interface. If you are familiar with the interfaces in the PEP249 specification, you can also use the methods provided by the `TaosCursor` class.

```python title="Use of TaosCursor"
D
dingbo 已提交
311
{{#include docs/examples/python/cursor_usage_native_reference.py}}
312 313 314 315 316 317 318 319 320 321 322 323
```

:::note
The TaosCursor class uses native connections for write and query operations. In a client-side multi-threaded scenario, this cursor instance must remain thread exclusive and cannot be shared across threads for use, otherwise, it will result in errors in the returned results.

:::

</TabItem>
<TabItem value="rest" label="REST connection">

##### Use of TaosRestCursor class

D
danielclow 已提交
324
The `TaosRestCursor` class is an implementation of the PEP249 Cursor interface.
325 326

```python title="Use of TaosRestCursor"
D
dingbo 已提交
327
{{#include docs/examples/python/connect_rest_examples.py:basic}}
328
```
D
danielclow 已提交
329
- `cursor.execute`: Used to execute arbitrary SQL statements.
330 331 332 333 334 335 336 337
- `cursor.rowcount` : For write operations, returns the number of successful rows written. For query operations, returns the number of rows in the result set.
- `cursor.description` : Returns the description of the field. Please refer to [TaosRestCursor](https://docs.taosdata.com/api/taospy/taosrest/cursor.html) for the specific format of the description information.

##### Use of the RestClient class

The `RestClient` class is a direct wrapper for the [REST API](/reference/rest-api). It contains only a `sql()` method for executing arbitrary SQL statements and returning the result.

```python title="Use of RestClient"
D
dingbo 已提交
338
{{#include docs/examples/python/rest_client_example.py}}
339 340 341 342
```

For a more detailed description of the `sql()` method, please refer to [RestClient](https://docs.taosdata.com/api/taospy/taosrest/restclient.html).

343 344 345 346 347 348
</TabItem>
<TabItem value="websocket" label="WebSocket connection">

```python
{{#include docs/examples/python/connect_websocket_examples.py:basic}}
```
D
danielclow 已提交
349

350 351
- `conn.execute`: can use to execute arbitrary SQL statements, and return the number of rows affected.
- `conn.query`: can use to execute query SQL statements, and return the query results.
D
danielclow 已提交
352

353 354 355
</TabItem>
</Tabs>

356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435
### Usage with req_id

By using the optional req_id parameter, you can specify a request ID that can be used for tracing.

<Tabs defaultValue="rest">
<TabItem value="native" label="native connection">

##### TaosConnection class

The `TaosConnection` class contains both an implementation of the PEP249 Connection interface (e.g., the `cursor()` method and the `close()` method) and many extensions (e.g., the `execute()`, `query()`, `schemaless_insert()`, and `subscribe()` methods).

```python title="execute method"
{{#include docs/examples/python/connection_usage_native_reference_with_req_id.py:insert}}
```

```python title="query method"
{{#include docs/examples/python/connection_usage_native_reference_with_req_id.py:query}}
```

:::tip
The queried results can only be fetched once. For example, only one of `fetch_all()` and `fetch_all_into_dict()` can be used in the example above. Repeated fetches will result in an empty list.
:::

##### Use of TaosResult class

In the above example of using the `TaosConnection` class, we have shown two ways to get the result of a query: `fetch_all()` and `fetch_all_into_dict()`. In addition, `TaosResult` also provides methods to iterate through the result set by rows (`rows_iter`) or by data blocks (`blocks_iter`). Using these two methods will be more efficient in scenarios where the query has a large amount of data.

```python title="blocks_iter method"
{{#include docs/examples/python/result_set_with_req_id_examples.py}}
```
##### Use of the TaosCursor class

The `TaosConnection` class and the `TaosResult` class already implement all the functionality of the native interface. If you are familiar with the interfaces in the PEP249 specification, you can also use the methods provided by the `TaosCursor` class.

```python title="Use of TaosCursor"
{{#include docs/examples/python/cursor_usage_native_reference_with_req_id.py}}
```

:::note
The TaosCursor class uses native connections for write and query operations. In a client-side multi-threaded scenario, this cursor instance must remain thread exclusive and cannot be shared across threads for use, otherwise, it will result in errors in the returned results.

:::

</TabItem>
<TabItem value="rest" label="REST connection">

##### Use of TaosRestCursor class

The `TaosRestCursor` class is an implementation of the PEP249 Cursor interface.

```python title="Use of TaosRestCursor"
{{#include docs/examples/python/connect_rest_with_req_id_examples.py:basic}}
```
- `cursor.execute`: Used to execute arbitrary SQL statements.
- `cursor.rowcount` : For write operations, returns the number of successful rows written. For query operations, returns the number of rows in the result set.
- `cursor.description` : Returns the description of the field. Please refer to [TaosRestCursor](https://docs.taosdata.com/api/taospy/taosrest/cursor.html) for the specific format of the description information.

##### Use of the RestClient class

The `RestClient` class is a direct wrapper for the [REST API](/reference/rest-api). It contains only a `sql()` method for executing arbitrary SQL statements and returning the result.

```python title="Use of RestClient"
{{#include docs/examples/python/rest_client_with_req_id_example.py}}
```

For a more detailed description of the `sql()` method, please refer to [RestClient](https://docs.taosdata.com/api/taospy/taosrest/restclient.html).

</TabItem>
<TabItem value="websocket" label="WebSocket connection">

```python
{{#include docs/examples/python/connect_websocket_with_req_id_examples.py:basic}}
```

- `conn.execute`: can use to execute arbitrary SQL statements, and return the number of rows affected.
- `conn.query`: can use to execute query SQL statements, and return the query results.

</TabItem>
</Tabs>

436 437
### Used with pandas

G
gccgdb1234 已提交
438
<Tabs defaultValue="rest">
439 440 441
<TabItem value="native" label="native connection">

```python
D
dingbo 已提交
442
{{#include docs/examples/python/conn_native_pandas.py}}
443 444 445 446 447 448
```

</TabItem>
<TabItem value="rest" label="REST connection">

```python
D
dingbo 已提交
449
{{#include docs/examples/python/conn_rest_pandas.py}}
450 451
```

452 453 454 455 456 457 458
</TabItem>
<TabItem value="websocket" label="WebSocket connection">

```python
{{#include docs/examples/python/conn_websocket_pandas.py}}
```

459 460 461 462 463 464 465 466 467 468 469
</TabItem>
</Tabs>

### Other sample programs

| Example program links | Example program content |
| ------------------------------------------------------------------------------------------------------------- | ------------------- ---- |
| [bind_multi.py](https://github.com/taosdata/taos-connector-python/blob/main/examples/bind-multi.py) | parameter binding, bind multiple rows at once |
| [bind_row.py](https://github.com/taosdata/taos-connector-python/blob/main/examples/bind-row.py) | bind_row.py
| [insert_lines.py](https://github.com/taosdata/taos-connector-python/blob/main/examples/insert-lines.py) | InfluxDB line protocol writing |
| [json_tag.py](https://github.com/taosdata/taos-connector-python/blob/main/examples/json-tag.py) | Use JSON type tags |
D
danielclow 已提交
470
| [tmq.py](https://github.com/taosdata/taos-connector-python/blob/main/examples/tmq.py)                         | TMQ subscription              |
471 472 473 474 475

## Other notes 

### Exception handling

476
All errors from database operations are thrown directly as exceptions and the error message from the database is passed up the exception stack. The application is responsible for exception handling. For example:
477 478

```python
D
dingbo 已提交
479
{{#include docs/examples/python/handle_exception.py}}
480 481 482 483
```

### About nanoseconds

484
Due to the current imperfection of Python's nanosecond support (see link below), the current implementation returns integers at nanosecond precision instead of the `datetime` type produced by `ms` and `us`, which application developers will need to handle on their own. And it is recommended to use pandas' to_datetime(). The Python Connector may modify the interface in the future if Python officially supports nanoseconds in full.
485 486 487 488 489 490 491 492 493 494 495 496

1. https://stackoverflow.com/questions/10611328/parsing-datetime-strings-containing-nanoseconds
2. https://www.python.org/dev/peps/pep-0564/

## Important Update

[**Release Notes**] (https://github.com/taosdata/taos-connector-python/releases)

## API Reference

- [taos](https://docs.taosdata.com/api/taospy/taos/)
- [taosrest](https://docs.taosdata.com/api/taospy/taosrest)
D
danielclow 已提交
497 498 499 500
  
## Frequently Asked Questions

Welcome to [ask questions or report questions](https://github.com/taosdata/taos-connector-python/issues).