Setup Alerts and Notifications

Alerts and Notifications

You can set up Device42 Alerts to notify you about application, asset, software, discovery, and other conditions that affect your IT environment. Alerts can generate notifications within the Device42 UI or be integrated with external systems such as PagerDuty, Opsgenie, etc.

As of v16.14.00, Device42 has refined and improved its Alerts functionality.

Set Up Alerts

To create an alert, select Analytics > Setup Alerts from the Device42 menu bar and then click Add Alert on the Select Alert to view page.

  • Enter a Name for the alert, and then select the Rule Type for the alert. Each Rule Type has associated Triggers, which define the conditions that will generate an alert.
  • Select the Trigger you want for the Rule Type you selected.
  • Select the Alert Type.
    • If you select D42 Notification, select or add the Escalation Profile to use for the alert. You use Escalation Profiles to set up email notification escalation tiers. Select Tools > Admins & Permissions > Escalation Profiles to add or edit escalation profiles.
    • If you select Alert Integration, select or add the alert integration (for example, PagerDuty, Opsgenie, etc.) to use for the alert. Select Tools > Integrations > External Integrations to view or add external integrations. Also see https://www.device42.com/integrations/ for more information about Device42 integrations.

As you construct your alert, Device42 displays the alert definition based on the options you select or enter. The example below shows an Operating System Count alert that sends a notification to the Alert Group 1 escalation profile.

Depending on the alert Rule Type you select, you may also see an Apply To: option that you use to select the objects you want to apply the alert to. Use the Available objects list on the left to select the objects you want to add to the Chosen objects list on the right.

Additional Options

  • Priority — Critical, High Priority, Medium Priority, Low Priority
  • Resend Notification — Amount of time (in seconds) to wait before sending an email notification for the alert. Leave this field blank to send the email only once.
  • Seconds Before Action — Amount of time (in seconds) that the alert condition (the trigger) exists before the Device42 generates the alert.
  • Job Interval Factor Before Action — (Displayed for specific power alerts in place of Seconds Before Action.) Trigger the action only if the condition persists for the Job Interval Factor times the polling rate for associated jobs. Default value is 1.

  • Notes — Any notes about the alert.
  • Notification Template — Text fields you can use to create templates for the Subject and Message of the alert email.

When you are done, click Save at the bottom of the page to save the alert.

Alert Rule Types and Triggers

Business Application

  • New Affinity Group Dependency Found
  • Affinity Group Dependency Removed

CRAC (Computer Room Air Conditioner)

  • Inlet Temperature
  • Return Temperature

Discovery

  • Scan Alerts
  • Power Circuit Mismatch
  • Job failed to start, because it’s still running
  • Job failed to start, because system experienced a problem
  • Job failed to start on time

Operating System

  • Count
  • License Percent used

Part Model

  • Number Available
  • % Available

Power Circuit

  • Avg Watts
  • Avg Current
  • Avg Volts

Power Unit

  • Is Offline
  • Is Not Responding
  • Bank Power (A)
  • Bank % of Rated Power (A)
  • Infeed Power (A)
  • Infeed % of Rated Power (A)
  • Sensor humidity (percentage)
  • Sensor temperature (fahrenheit)
  • Sensor temperature (celsius)

Software

  • Count
  • License Percent Used
  • Prohibited Software Installed

Subnet

  • Percent Used

System

  • Days Until License Expiration
  • Percent of Licensed devices
  • Percent of Licensed IPs
  • Any Remote Collector Disconnected
  • Any WDS Disconnected

UPS

  • Battery % Capacity Remaining
  • Battery Time Remaining
  • Actual Voltage
  • Actual Current
  • Load %
  • A Battery is Low
  • A Battery is Depleted
  • A Battery Needs Replacing
  • A Battery Has a Fault

Notifications

If you select D42 Notification as the Alert Type when you create an alert, Device42 generates a notification when the alert condition triggers the alert. Device42 adds all notifications to the Notifications page.

The Device42 Dashboard now includes two links – My Notifications and All Notifications – to the Notifications page, and  there is a bell icon in the menu bar that animates when you receive new notifications. You can hover over the icon to see a list of recent notifications or jump the the Notifications page.

You can also select Analytics > Notifications from the menu bar to go to the Notifications page.

The page displays notifications for all triggered alerts whose Alert Type is Device42 Notification. Click My Notifications or All Notifications to see the notification lists.

Select the alerts you want and select the Action drop-down menu to perform the following action on the notifications

  • Delete selected Notifications – deletes the notification. If the condition reoccurs, Device42 generates a new notification.
  • Export selected items to CSV – create a notification CSV.
  • Acknowledge Alerts – acknowledge the alert that generated the notification. Any alert recipient can acknowledge alerts.
  • Clear Conditions – clear the alert condition that generated the notification. This indicates that the condition has been fixed and Device42 will generate a new alert and notification if the condition reoccurs. Note that only one notification is sent until the condition is marked as cleared.

Customizing Alerts with Notification Variables

The new alerts engine offers powerful new alerting variables that let you configure custom alert emails that can include useful data for each Alert Rule Type Device42 supports the following variables for embedding within alert emails when creating alerts.
You can use all the tags in both the alert message and subject line:
Alert %(sensor_rule_name)s triggered on RC# %(remote_collector_id)s
Rule Type Trigger parameters supported

Business Application

New Affinity Group Dependency Found %(from_model)s – from_model
%(to_model)s – to_model the device by default (for now we support only device-to-device connections in Business Apps)
%(from_name)s – from_name
%(to_name)s – to_name- the name of object (device)
%(business_app)s – the Business App name
Affinity Group Dependency Removed

CRAC

Inlet Temperature %(sensor_rule_id)s – Alert Rule ID
%(sensor_rule_name)s – Alert Rule Name
%(device_type)s – Device / PDU
%(device_name)s – Device name
%(device_id)s – Device ID
%(device_ip)s – Device IP
%(element_type)s – Infeed, output, bank, etc.
%(element_name)s – Name if available, if not – ID
%(element_id)s – Element ID
%(measure)s – Current/power, status, etc.
%(value)s – Value that triggered action
%(units)s – W(atts), Amps, etc.
%(status)s – Critcal/OK
%(remote_collector_id)s – RC ID
%(remote_collector_name)s – RC Name
%(date)s – Date when alert was triggered
Return Temperature

Discovery

Scan Alerts %(scan_type)s
%(interval)s
%(scanned_num)s
%(failed_num)s
%(failed_auth_num)s
Power Circuit Mismatch %(discovered_circuits_num)s
%(current_circuits_num)s
%(bcpm_name)s
Job failed to start,because it’s still running %(task_type)s
%(job_name)s – may be present if name of task is present during internal processing
%(rc_id)s
%(job_id)s
%(additional_info)s -displays additional error info, but only if available (not guaranteed to be there)
Job failed to start,because system experienced a problem %(task_type)s
%(rc_id)s
%(job_id)s
%(additional_info)s – displays additional error info, but only if available (not guaranteed to be there)
Job failed to start ontime %(job_name)s
%(job_id)s

Operating System

Count %(str_item_type)s – ‘Software’ or ‘OS’
%(value)s – real current value
%(element_name)s -Software name
%(license_key)s
%(operator)s – from Alert
%(trigger_value)s – value from Alert
License Percent Used %(str_item_type)s – ‘Software’ or ‘OS’
%(value)s – real current value
%(element_name)s -Software name
%(license_key)s
%(operator)s – from Alert
%(trigger_value)s – value from Alert

Part Model

Number Available %(type)s – type name
%(prop_name)s – name or model #
%(name)s- value of name or model #
%(value) – current value
%(operator) – alert condition operator
%(val)s- alert condition value
% Available %(type)s – type name
%(prop_name)s – name or model #
%(name)s- value of name or model #
%(value) – current value
%(operator) – alert condition operator
%(val)s- alert condition value

Power Circuit 

Avg Watts %(sensor_rule_id)s – Alert Rule ID
%(sensor_rule_name)s – Alert Rule Name
%(device_type)s – Device / PDU
%(device_name)s – Device name
%(device_id)s – Device ID
%(device_ip)s – Device IP
%(element_type)s – Infeed, output, bank, etc.
%(element_name)s – Name if available, if not – ID
%(element_id)s – Element ID
%(measure)s – Current/power, status, etc.
%(value)s – Value that triggered action
%(units)s – W(atts), Amps, etc.
%(status)s – Critcal/OK
%(remote_collector_id)s – RC ID
%(remote_collector_name)s – RC Name
%(date)s – Date when alert was triggered
Avg Currents
Avg Volts

Power Unit

Is Offline %(sensor_rule_id)s – Alert Rule ID
%(sensor_rule_name)s – Alert Rule Name
%(device_type)s – Device / PDU
%(device_name)s – Device name
%(device_id)s – Device ID
%(device_ip)s – Device IP
%(element_type)s – Infeed, output, bank, etc.
%(element_name)s – Name if available, if not – ID
%(element_id)s – Element ID
%(measure)s – Current/power, status, etc.
%(value)s – Value that triggered action
%(units)s – W(atts), Amps, etc.
%(status)s – Critcal/OK
%(remote_collector_id)s – RC ID
%(remote_collector_name)s – RC Name
%(date)s – Date when alert was triggered
Is Not Responding
Bank Power(A)
Bank % of Rated Power(A)
Infeed Power()
Infeed % of Rated Power()
Infeed Power(A)
Infeed % of Rated Power(A)
Sensor humidity(percentage)
Sensor temperature (fahrenheit)
Sensor temperature (celsius)

Software

Count %(str_item_type)s – ‘Software’ or ‘OS’
%(value)s – real current value
%(element_name)s -Software name
%(license_key)s
%(operator)s – from Alert
%(trigger_value)s – value from Alert
License Percent Used %(str_item_type)s – ‘Software’ or ‘OS’
%(value)s – real current value
%(element_name)s -Software name
%(license_key)s
%(operator)s – from Alert
%(trigger_value)s – value from Alert
Prohibited Software Installed %(software_name)s
%(device_name)s
%(user_name)s

Subnet

Persent used subnet name – %(subnet_name)s
usage percent – %(percent_used)s

System

Days Until License Expiration %(days_to_expiration)
Percent of Licensed devices %(device_used_percentage)s
%(device_used)s
%(total_device_licensed)
Percent of Licensed Ips %(ip_used_percentage)s
%(ip_used)s
%(total_ip_licensed)s
Any Remote Collector Disconnected %(remote_collector_name)s
Any WDS Disconnected %(wds_ip)s

UPS

Battery % Capacity Remaining %(sensor_rule_id)s – Alert Rule ID
%(sensor_rule_name)s – Alert Rule Name
%(device_type)s – Device / PDU
%(device_name)s – Device name
%(device_id)s – Device ID
%(device_ip)s – Device IP
%(element_type)s – Infeed, output, bank, etc.
%(element_name)s – Name if available, if not – ID
%(element_id)s – Element ID
%(measure)s – Current/power, status, etc.
%(value)s – Value that triggered action
%(units)s – W(atts), Amps, etc.
%(status)s – Critcal/OK
%(remote_collector_id)s – RC ID
%(remote_collector_name)s – RC Name
%(date)s – Date when alert was triggered
Battery Time Remaining
Actual Voltage
Actual Current
Load %
A Battery is Low
A Battery is Depleted
A Battery Needs Replacing
A Battery has a Fault