Operation¶
The following aspects apply to managing Trino Gateway and the connected Trino clusters.
Query History UI - check query plans etc.¶
Trino Gateway records history of recent queries and displays links to check query
details page in respective trino cluster.
Cluster UI - add and modify cluster information¶
The cluster page is used to configure Trino Gateway for multiple Trino clusters. Existing cluster information can also be modified using the edit button.
Graceful shutdown¶
Trino Gateway supports graceful shutdown of Trino clusters. Even when a cluster is deactivated, any submitted query states can still be retrieved based on the Query ID.
To graceful shutdown a trino cluster without query losses, the steps are:
- Deactivate the cluster by turning off the 'Active' switch. This ensures that no new incoming queries are routed to the cluster.
- Poll the Trino cluster coordinator URL until the queued query count and the running query count are both zero.
- Terminate the Trino coordinator and worker Java processes.
To gracefully shutdown a single worker process, refer to the Trino documentation for more details.
Query routing options¶
- The default router selects the cluster randomly to route the queries.
- If you want to route the queries to the least loaded cluster for a user
so the cluster with the fewest running or queued queries,
use
QueryCountBasedRouter
. You can enable it by adding the module name to themodules
section of the config file:
taskDelaySeconds
under
monitor
section in the config file. The default interval is 60
Monitoring ¶
Trino Gateway provides a metrics endpoint that uses the OpenMetrics format at
/metrics
. Use it to monitor Trino Gateway instances with Prometheus and
other compatible systems with the following Prometheus configuration:
Trino Gateway health endpoints¶
Trino Gateway provides two API endpoints to indicate the current status of the server:
/trino-gateway/livez
always returns status code 200, indicating the server is alive. However, it might not respond if the Trino Gateway is too busy, stuck, or taking a long time for garbage collection./trino-gateway/readyz
returns status code 200, indicating the server has completed initialization and is ready to serve requests. This means the initial connection to the database and the first round of health check on Trino clusters are completed. Otherwise, status code 503 is returned.