Amazon OpenSearch Service clusters provide a wealth of operational metrics accessible by means of CloudWatch and the Amazon OpenSearch Service console to help efficient efficiency monitoring and alert creation. But, pinpointing resiliency and efficiency challenges inside your cluster can show daunting. The method of figuring out resource-intensive queries or understanding efficiency degradation tendencies might be time-consuming.
To handle these challenges, we launched Cluster insights, which presents a unified dashboard delivering curated insights together with actionable mitigation steps. The dashboard shows detailed metrics on the node, index, and shard ranges, coupled with a concise abstract of safety and resiliency finest practices to uphold peak resiliency and availability.
This weblog will information you thru organising and utilizing Cluster Insights, together with key options and metrics. By the conclusion, you’ll perceive easy methods to use Cluster insights to acknowledge and deal with efficiency and resiliency points inside your OpenSearch Service clusters.
Getting Began with Cluster insights
Cluster insights is accessible at no extra value to OpenSearch Service customers working OpenSearch model 2.17 or later. Accessing Cluster insights requires admin-level permissions on your OpenSearch area. Cluster insights is accessible solely by means of the OpenSearch UI. OpenSearch UI provides help to a number of knowledge sources, zero downtime upgrades on your dashboard expertise, and curated workspaces for efficient crew collaborations. You first must affiliate a knowledge supply (your clusters) with an OpenSearch UI utility. Detailed steps are described within the person information. Your OpenSearch UI console expertise will seem like following screenshots.
To entry Cluster insights utilizing the OpenSearch UI utility:
- Within the Amazon OpenSearch Service console, navigate to OpenSearch UI (Dashboards) and select the Software URL to entry your OpenSearch UI utility.

- OpenSearch UI utility, select the settings icon on the left-bottom nook, then select Knowledge administration.

- On the Knowledge administration overview web page, or beneath Handle knowledge within the left navigation, choose Cluster insights.

Cluster insights overview
The Cluster insights – Overview acts as a touchdown web page to point out well being and insights for all related OpenSearch domains. It’s organized into 5 sections:

- Present cluster standing – Shows cluster well being standing (Inexperienced, Yellow, and Crimson) in a donut chart.
- Insights pattern – Tracks concern patterns over the previous 30 days, serving to you determine rising issues and monitor decision progress. This pattern evaluation turns into significantly beneficial when monitoring the affect of operational adjustments or troubleshooting recurring points.
- Present open insights – Reveals the depend and severity breakdown of at present lively insights throughout your clusters.
- OpenSearch service clusters – Lists all domains with their important statistics similar to well being standing, insights depend, nodes, shards, and lively queries.
- Prime insights by severity – Prioritizes points that want rapid consideration. Every perception comes with a transparent description and particular suggestions, remodeling advanced monitoring knowledge into actionable duties. This prioritized view helps groups can deal with crucial points first, whether or not they’re addressing shard measurement issues, disk house points, or efficiency bottlenecks.
Collectively, these sections present a complete view of your OpenSearch Service infrastructure so you possibly can assess cluster well being, determine tendencies, and take motion on crucial points from a single dashboard.
Cluster well being
Once you select a selected cluster from the OpenSearch domains on the Cluster insights – Overview web page, you will notice cluster-specific particulars together with well being standing, lively insights, and efficiency metrics. The overview part shows cluster well being together with important metrics together with depend of shards, nodes, indices, and a complete doc measurement. You may as well assessment the configuration finest practices adopted by area throughout resiliency and safety areas.
The decrease part comprises a desk of actionable insights that presents an in depth view of present points. This desk mirrors the insights from the touchdown web page however focuses particularly on points affecting the chosen cluster. You may observe high-severity points similar to low disk house and shard depend issues, in addition to medium-severity considerations which will affect cluster efficiency.
Every perception entry serves as an interactive aspect – choosing any concern reveals an in-depth evaluation full with root trigger identification and particular remediation steps. The desk consists of necessary metadata similar to technology timestamps, severity ranges, advice counts, and present standing, so customers can prioritize and deal with points successfully.

Perception particulars
Each perception provides detailed evaluation and actionable suggestions. Take the Shard Depend perception for example: choosing it reveals a complete breakdown of the difficulty. You’ll see that your OpenSearch cluster has breached the variety of shards allowed on the nodes primarily based on its JVM heap measurement, together with an in depth checklist of affected assets.

The detailed view features a useful resource map that exactly identifies every impacted node and index, displaying crucial data similar to node IDs, shard counts, and the indices contributing to the difficulty.
The suggestions are organized into two ranges: cluster-level suggestions deal with total structure enhancements, similar to scaling your cluster or adjusting international shard allocation settings. Index-level suggestions present particular actions for particular person indices—for instance, you may see recommendations to maneuver idle shards to UltraWarm storage. These are shards with none search or indexing operations for the final 10 days and are a minimum of 5 days outdated, making them perfect candidates for heat storage to scale back the lively shard depend. All of this steering is accessible instantly inside the Cluster insights interface, eliminating the necessity to swap between totally different instruments or consoles.
Node, Index, Shard, and Question view
Subsequent to cluster well being, you possibly can assessment Node, Index, Shard, and Question particulars for a selected cluster. These views current crucial metrics similar to useful resource (CPU, reminiscence, disk) utilization, search and index latency.
Node view
The Node view tab supplies a complete view of particular person node efficiency throughout your cluster. This desk shows crucial metrics for every node together with warmth rating indicating total node well being, useful resource utilization (CPU, reminiscence, disk), search and indexing latency and charges, together with fast hyperlinks to view high N shards and queries working on every node.
This view helps you determine nodes experiencing excessive useful resource utilization or efficiency degradation. You may drill deeper into every node by clicking on the node ID to view detailed time-based metrics displaying useful resource utilization tendencies over time. Moreover, you possibly can click on the highest N shards hyperlink to navigate on to the Shard View, mechanically filtered to point out solely the shards working on the chosen node, permitting you to pinpoint which particular shards are contributing to efficiency points.

Index view
The Index view tab reveals efficiency metrics aggregated on the index degree. For every index, you possibly can monitor doc depend and storage measurement, search latency and price, indexing latency and price, and entry high N queries affecting the index. This angle is efficacious for understanding which indices are driving cluster load and figuring out optimization alternatives on the index configuration degree.

Shard view
The Shard view tab provides essentially the most granular view of cluster efficiency by displaying metrics for particular person shards. Every row reveals shard ID and its assigned node, index affiliation and useful resource strain metrics (CPU, reminiscence), together with search and indexing latency per shard. This detailed view lets you pinpoint particular shards inflicting efficiency points, determine shard placement imbalances, and take focused remediation actions.

Question view
The Question view on the Cluster insights web page solves presents reside dashboards that break down execution stats, CPU and reminiscence utilization, and completion progress for each question. This helps monitor which queries are driving the largest useful resource consumption (the Prime-N queries). With intuitive donut charts and scoreboards displaying distribution by node, index, and person, this interface helps operators to rapidly pinpoint efficiency bottlenecks and heavy workloads, supporting focused optimization and assured scaling selections.

Question insights
Along with Cluster insights, you may as well get Question insights to view the precise queries working and latencies throughout Increase, Question, and Fetch phases that gives beneficial insights for search builders to additional fine-tune their queries.

Conclusion
Cluster insights transforms OpenSearch Service cluster administration from reactive troubleshooting to proactive optimization. By offering unified dashboards with warmth rating, and finest practices throughout stability, resiliency, and safety pillars, it provides visibility into your search infrastructure on the account degree.
The actionable suggestions and step-by-step remediation steering assist customers of all expertise ranges successfully resolve advanced points like shard imbalances and useful resource bottlenecks.
The combination with Question insights delivers real-time visibility into useful resource consumption patterns in order that groups can determine and optimize performance-critical queries by means of detailed profiling and latency evaluation.
For extra data, see the AWS OpenSearch Service Person Information for extra particulars.



