Contents
Alerts and Notifications
You can set up Device42 Alerts to notify you about application, asset, software, discovery, and other conditions that affect your IT environment. Alerts can generate notifications within the Device42 UI or be integrated with external systems such as PagerDuty, Opsgenie, etc.
As of v16.14.00, Device42 has refined and improved its Alerts functionality.
Set Up Alerts
To create an alert, select Analytics > Setup Alerts from the Device42 menu bar and then click Add Alert on the Select Alert to view page.
- Enter a Name for the alert, and then select the Rule Type for the alert. Each Rule Type has associated Triggers, which define the conditions that will generate an alert.
- Select the Trigger you want for the Rule Type you selected.
- Select the Alert Type.
- If you select D42 Notification, select or add the Escalation Profile to use for the alert. You use Escalation Profiles to set up email notification escalation tiers. Select Tools > Admins & Permissions > Escalation Profiles to add or edit escalation profiles.
- If you select Alert Integration, select or add the alert integration (for example, PagerDuty, Opsgenie, etc.) to use for the alert. Select Tools > Integrations > External Integrations to view or add external integrations. Also see https://www.device42.com/integrations/ for more information about Device42 integrations.
As you construct your alert, Device42 displays the alert definition based on the options you select or enter. The example below shows an Operating System Count alert that sends a notification to the Alert Group 1 escalation profile.
Depending on the alert Rule Type you select, you may also see an Apply To: option that you use to select the objects you want to apply the alert to. Use the Available objects list on the left to select the objects you want to add to the Chosen objects list on the right.
Additional Options
- Priority — Critical, High Priority, Medium Priority, Low Priority
- Resend Notification — Amount of time (in seconds) to wait before sending an email notification for the alert. Leave this field blank to send the email only once.
- Seconds Before Action — Amount of time (in seconds) that the alert condition (the trigger) exists before the Device42 generates the alert.
- Job Interval Factor Before Action — (Displayed for specific power alerts in place of Seconds Before Action.) Trigger the action only if the condition persists for the Job Interval Factor times the polling rate for associated jobs. Default value is 1.
- Notes — Any notes about the alert.
- Notification Template — Text fields you can use to create templates for the Subject and Message of the alert email.
When you are done, click Save at the bottom of the page to save the alert.
Alert Rule Types and Triggers
Business Application
- New Affinity Group Dependency Found
- Affinity Group Dependency Removed
CRAC (Computer Room Air Conditioner)
- Inlet Temperature
- Return Temperature
Discovery
- Scan Alerts
- Power Circuit Mismatch
- Job failed to start, because it’s still running
- Job failed to start, because system experienced a problem
- Job failed to start on time
Operating System
- Count
- License Percent used
Part Model
- Number Available
- % Available
Power Circuit
- Avg Watts
- Avg Current
- Avg Volts
Power Unit
- Is Offline
- Is Not Responding
- Bank Power (A)
- Bank % of Rated Power (A)
- Infeed Power (A)
- Infeed % of Rated Power (A)
- Sensor humidity (percentage)
- Sensor temperature (fahrenheit)
- Sensor temperature (celsius)
Software
- Count
- License Percent Used
- Prohibited Software Installed
Subnet
- Percent Used
System
- Days Until License Expiration
- Percent of Licensed devices
- Percent of Licensed IPs
- Any Remote Collector Disconnected
- Any WDS Disconnected
UPS
- Battery % Capacity Remaining
- Battery Time Remaining
- Actual Voltage
- Actual Current
- Load %
- A Battery is Low
- A Battery is Depleted
- A Battery Needs Replacing
- A Battery Has a Fault
Notifications
If you select D42 Notification as the Alert Type when you create an alert, Device42 generates a notification when the alert condition triggers the alert. Device42 adds all notifications to the Notifications page.
The Device42 Dashboard now includes two links – My Notifications and All Notifications – to the Notifications page, and there is a bell icon in the menu bar that animates when you receive new notifications. You can hover over the icon to see a list of recent notifications or jump the the Notifications page.
You can also select Analytics > Notifications from the menu bar to go to the Notifications page.
The page displays notifications for all triggered alerts whose Alert Type is Device42 Notification. Click My Notifications or All Notifications to see the notification lists.
Select the alerts you want and select the Action drop-down menu to perform the following action on the notifications
- Delete selected Notifications – deletes the notification. If the condition reoccurs, Device42 generates a new notification.
- Export selected items to CSV – create a notification CSV.
- Acknowledge Alerts – acknowledge the alert that generated the notification. Any alert recipient can acknowledge alerts.
- Clear Conditions – clear the alert condition that generated the notification. This indicates that the condition has been fixed and Device42 will generate a new alert and notification if the condition reoccurs. Note that only one notification is sent until the condition is marked as cleared.
Customizing Alerts with Notification Variables
Rule Type | Trigger | parameters supported |
Business Application |
New Affinity Group Dependency Found | %(from_model)s – from_model %(to_model)s – to_model the device by default (for now we support only device-to-device connections in Business Apps) %(from_name)s – from_name %(to_name)s – to_name- the name of object (device) %(business_app)s – the Business App name |
Affinity Group Dependency Removed | ||
CRAC |
Inlet Temperature | %(sensor_rule_id)s – Alert Rule ID %(sensor_rule_name)s – Alert Rule Name %(device_type)s – Device / PDU %(device_name)s – Device name %(device_id)s – Device ID %(device_ip)s – Device IP %(element_type)s – Infeed, output, bank, etc. %(element_name)s – Name if available, if not – ID %(element_id)s – Element ID %(measure)s – Current/power, status, etc. %(value)s – Value that triggered action %(units)s – W(atts), Amps, etc. %(status)s – Critcal/OK %(remote_collector_id)s – RC ID %(remote_collector_name)s – RC Name %(date)s – Date when alert was triggered |
Return Temperature | ||
Discovery |
Scan Alerts | %(scan_type)s %(interval)s %(scanned_num)s %(failed_num)s %(failed_auth_num)s |
Power Circuit Mismatch | %(discovered_circuits_num)s %(current_circuits_num)s %(bcpm_name)s |
|
Job failed to start,because it’s still running | %(task_type)s %(job_name)s – may be present if name of task is present during internal processing %(rc_id)s %(job_id)s %(additional_info)s -displays additional error info, but only if available (not guaranteed to be there) |
|
Job failed to start,because system experienced a problem | %(task_type)s %(rc_id)s %(job_id)s %(additional_info)s – displays additional error info, but only if available (not guaranteed to be there) |
|
Job failed to start ontime | %(job_name)s %(job_id)s |
|
Operating System |
Count | %(str_item_type)s – ‘Software’ or ‘OS’ %(value)s – real current value %(element_name)s -Software name %(license_key)s %(operator)s – from Alert %(trigger_value)s – value from Alert |
License Percent Used | %(str_item_type)s – ‘Software’ or ‘OS’ %(value)s – real current value %(element_name)s -Software name %(license_key)s %(operator)s – from Alert %(trigger_value)s – value from Alert |
|
Part Model |
Number Available | %(type)s – type name %(prop_name)s – name or model # %(name)s- value of name or model # %(value) – current value %(operator) – alert condition operator %(val)s- alert condition value |
% Available | %(type)s – type name %(prop_name)s – name or model # %(name)s- value of name or model # %(value) – current value %(operator) – alert condition operator %(val)s- alert condition value |
|
Power Circuit |
Avg Watts | %(sensor_rule_id)s – Alert Rule ID %(sensor_rule_name)s – Alert Rule Name %(device_type)s – Device / PDU %(device_name)s – Device name %(device_id)s – Device ID %(device_ip)s – Device IP %(element_type)s – Infeed, output, bank, etc. %(element_name)s – Name if available, if not – ID %(element_id)s – Element ID %(measure)s – Current/power, status, etc. %(value)s – Value that triggered action %(units)s – W(atts), Amps, etc. %(status)s – Critcal/OK %(remote_collector_id)s – RC ID %(remote_collector_name)s – RC Name %(date)s – Date when alert was triggered |
Avg Currents | ||
Avg Volts | ||
Power Unit |
Is Offline | %(sensor_rule_id)s – Alert Rule ID %(sensor_rule_name)s – Alert Rule Name %(device_type)s – Device / PDU %(device_name)s – Device name %(device_id)s – Device ID %(device_ip)s – Device IP %(element_type)s – Infeed, output, bank, etc. %(element_name)s – Name if available, if not – ID %(element_id)s – Element ID %(measure)s – Current/power, status, etc. %(value)s – Value that triggered action %(units)s – W(atts), Amps, etc. %(status)s – Critcal/OK %(remote_collector_id)s – RC ID %(remote_collector_name)s – RC Name %(date)s – Date when alert was triggered |
Is Not Responding | ||
Bank Power(A) | ||
Bank % of Rated Power(A) | ||
Infeed Power() | ||
Infeed % of Rated Power() | ||
Infeed Power(A) | ||
Infeed % of Rated Power(A) | ||
Sensor humidity(percentage) | ||
Sensor temperature (fahrenheit) | ||
Sensor temperature (celsius) | ||
Software |
Count | %(str_item_type)s – ‘Software’ or ‘OS’ %(value)s – real current value %(element_name)s -Software name %(license_key)s %(operator)s – from Alert %(trigger_value)s – value from Alert |
License Percent Used | %(str_item_type)s – ‘Software’ or ‘OS’ %(value)s – real current value %(element_name)s -Software name %(license_key)s %(operator)s – from Alert %(trigger_value)s – value from Alert |
|
Prohibited Software Installed | %(software_name)s %(device_name)s %(user_name)s |
|
Subnet |
Persent used | subnet name – %(subnet_name)s usage percent – %(percent_used)s |
System |
Days Until License Expiration | %(days_to_expiration) |
Percent of Licensed devices | %(device_used_percentage)s %(device_used)s %(total_device_licensed) |
|
Percent of Licensed Ips | %(ip_used_percentage)s %(ip_used)s %(total_ip_licensed)s |
|
Any Remote Collector Disconnected | %(remote_collector_name)s | |
Any WDS Disconnected | %(wds_ip)s | |
UPS |
Battery % Capacity Remaining | %(sensor_rule_id)s – Alert Rule ID %(sensor_rule_name)s – Alert Rule Name %(device_type)s – Device / PDU %(device_name)s – Device name %(device_id)s – Device ID %(device_ip)s – Device IP %(element_type)s – Infeed, output, bank, etc. %(element_name)s – Name if available, if not – ID %(element_id)s – Element ID %(measure)s – Current/power, status, etc. %(value)s – Value that triggered action %(units)s – W(atts), Amps, etc. %(status)s – Critcal/OK %(remote_collector_id)s – RC ID %(remote_collector_name)s – RC Name %(date)s – Date when alert was triggered |
Battery Time Remaining | ||
Actual Voltage | ||
Actual Current | ||
Load % | ||
A Battery is Low | ||
A Battery is Depleted | ||
A Battery Needs Replacing | ||
A Battery has a Fault |