|
Sign-In Sources Mailing lists Downloads Live collector Documentation Admins Contacts Welcome |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Documentation index Tabulations![]() Views are accessed through the tabulations in the site's pages header. Each view has a persistent filters bar and column selector (folded under the tabs), a column filters bar (top), and a tool bar (bottom). Persistent filters bar![]() The orange thick line under the tabs can be unfolded by clicking on it or through the ctrl+: or ctrl+; shortcuts. The filters set there are stored in database and are applied on all collector views. They persist across navigation sessions. Filters can apply on data not presented in the current view or embed hardcoded complex queries, like 'nodes with services'. ![]() Upon filter selection, a tag cloud appears to display all possible values of the filter. Clicking a tag will set the filter value box. The filter value follow the same syntax rules as the column filters. A filter/value is added by clicking on the '+' blue icon and removed by clicking on the 'x' red icon. Column selectorView columns can be toggled on and off from the column selector. Each view has a default set of column displayed. User column selection is stored in database, thus persistent across user sessions and different browsing devices. Column filters barEach column of the table supports filtering. Filters are applied upon <enter> in any filter box. Filter box expressions are and'ed. Filter syntax supports :
Service tabs![]() In all views, clicking on a service name will open the service tabs to display the most comprehensive data set the collector knows about this service. propertiesopensvc versionPushed to database daily by the opensvc nodeware cronjob. unackowledged errorsA count of the unacknowledged errors on this service found in the actions table. typeService type can be PRD, DEV, ... commentPushed to database daily by the opensvc nodeware cronjob. This information resides in the service env file on the nodes. last updateTimestamp updated upon service information receive from the nodeware cronjob. container nameHost Name of the virtual machine encapsulated in the service, if any. container typeVirtualisation driver to handle the encapsulated virtual machine, if any. responsiblesList of administrators contact names for this service. responsibles mailList of administrators contact emails for this service. primary nodeHost name of the node where the service should be running in optimal situation. nodesAll nodes where the service may be running in degraded situation (not in disaster recovery situation). drp nodeHost name of the node the service should be running in a disaster recovery situation. drp nodesAll nodes where the service may be running in disaster recovery situation. statusDisplay the per-node synthetic service status. resourcesDisplay the per-node per-resource detailled service status. envDisplay the raw service configuration file pushed by the nodeware. topology![]() Sketches service-nodes-storage-sites relationships Node tabs![]() In all views, clicking on a node name will open the node tabs to display the most comprehensive data set the collector knows about this node. server, os, mem, cpu, location, powerInformation displayed in these tabs are extracted from the asset table. Assets can be loaded from a tiers enterprise asset manager or filled-in manually. statsGenerate and display graphs of performance metrics on the user-selected time range. The metrics are gathered daily from tools like glance and sar running on the nodes. The opensvc nodeware relays this information to database. metricsMetrics collection depend on the operating system. Some of them are:
screencastTool barThe tool bar hosts generic tools and view-specific tools.
dashboardUsageThis page display all site's problems detected by the collector. The session filters applies, so administrators can only see the problems affecting the nodes and services they are responsible for. This is the one-stop page for administrators morning checks, and for production team health check. Layout:
services viewUsageThis view presents services health. Main usage:
Screenshot![]() FieldsContainer typeThe container type reflect the kind of service integration selected for the service. This setting is set in the service configuration file on the nodes as the mode parameter. The possible values are, as of January 2010 :
Responsible
Prefered nodeUpon node reboot, only the services whose prefered node is set to this node are started. During those services start-up, the nodeware verifies the service is not running on another node and aborts the start-up if so. Service typeThis property is used by the nodeware to sanity check actions asked by the admin.
AppThe application code is a way to group services dedicated/paid by some corporate entity or project. You can setup any application code you want in your services configuration file, using the app parameter. StatusStatus gizmos present the overall service health on top, and individual resource type status below. Color codes are:
ToolsActionsDashboard notifications![]() ![]() ![]() ![]() actions viewUsageThis view displays the journal of all services events with their status. Main usage:
Screenshot![]() FieldsServiceService name the action applied to. Private collectors usually report shortnames, shared collectors usually report service names with domain names. AppThe application code is a way to group services dedicated/paid by some corporate entity or project. You can setup any application code you want in your services configuration file, using the app parameter. Responsible
NodeNode where the action has be executed. The node name is a link to the asset view. ActionThe executed action name. An action usually aggregates a number of log lines, plus a line with no log message as a header. The default action view has an 'empty' log filter active so you are presented only actions without their logs. You can drill down a specific action by clicking on its pid.
Status
BeginBegin timestamp of the action. EndEnd timestamp of the action. PidProcess identifier of the session handling the action on the node. Click to active a pid filter with this value. LogThe action log as it is displayed on the node standart output. Dashboard notifications![]() checks viewUsageThe OpenSVC nodeware gathers check test values from the nodes and pushes them to the collector. Current implemented checks are:
More checks will be implemented when client need arise. Each check is validated against high and low threshold values. If the current check value is outside boundaries, the check raises a warning in the dashboard and the value and crossed threshold are highlighted red in the checks view. A check without value update for more than 24 hours will have its update date highlighted red. These checks instances should not be trusted. Screenshot![]() Dashboard notifications![]() Setting thresholdsCheck types have default high and low threshold values. These values can be customized for each node and instance of the check. Two customization methods are proposed:
Check default values can be adjusted by clicking on the check type in the dashboard warning line or in the checks view. ![]() packages viewUsageThe OpenSVC nodeware gathers installed package list from the nodes and pushes it to the collector. Centralized package lists allow administrators to easily:
Packages database entries not refreshed for more than 24 hours will have its update date highlighted red. Information for such a package should not be trusted. The toolbox of the node view proposes a tool to highlight package differences between selected nodes. Screenshot![]() Dashboard notifications![]() Clicking on the number of difference pops up the details. For exemple, on the BSD cluster above, the details show the lsof and opensvc packages are installed only on the 'bsd1' node. ![]() patches viewUsageThe OpenSVC nodeware gathers installed patches list from the nodes (Solaris) and pushes it to the collector. Centralized patches lists allow administrators to easily:
Patch database entries not refreshed for more than 24 hours will have its update date highlighted red. Information for such a patch should not be trusted. Screenshot![]() Service availabilityUsageThis view presents service availability data the collector collects. A chart helps assess the problems at a glance, and a table displays each unavailability time range and aggregate availability ratio on the selected period. The begin and end points of the analyzed period can be adjusted. Each unavailability time range can be either:
An acknowledged range will show the ack message, ack author and ack date when the cursor moves over the range dates. Clicking on a date will display a state transition graph to help determine which ressource caused to outage. Screenshot![]() operating system lifecycleUsageThis view charts the operating system dispatch of all the nodes it collects. A first chart presents the dispatch per operating system family. Subsequent charts focus on each family. These charts allow administrators and managers to analyse:
Screenshot![]() ![]() Obsolescence managementThe collector helps plan operating systems and hardware upgrades through constant notification of items beyond user-defined warning and alerting thresholds. This view is accessed through the 'admin' tab. Obsolescence setup![]() Operating systems and hardware are collected from the asset database. The refresh button parses the database in search of new items to add. The count of nodes matching each different os and hardware type is displayed in the 'count' column. Clicking on the count number displays the list of nodes. The 'Warn date' and 'Alert date' thresholds can be set by clicking on the table cell. Obsolescence notificationsThe notifications appear in the dashboard view. Missing obsolescence information![]() This notification table is displayed only to users with the 'Manager' role. It shows how many operating system and hardware type items have no warn and alert date set by the managers. Clicking on a number in the table will redirect to the obsolescence configuration view, with appropriate filters set. Nodes above obsolescence thresholds![]() Alerts and warnings are in separate tables. Operating systems and hardware are in separate tables. Empty lists are not displayed. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Copyright © 2010 OpenSVC - Powered by web2py |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||