LOTC Time Synchronization

Contents

1Introduction
1.1Prerequisites
1.2Related Information
1.3Revision Information

2

Alarm Description

3

Procedure

Reference List

1   Introduction

This document is the Operating Instruction (OPI) for the alarm LOTC Time Synchronization.

Scope

This document covers the following topics:

Target Groups

This document is intended for personnel involved in alarm handling.

1.1   Prerequisites

This section describes the possible documents, tools, and conditions needed before performing steps to cease the alarm.

1.1.1   Documents

Not applicable.

1.1.2   Tools

Not applicable.

1.1.3   Conditions

Not applicable.

1.2   Related Information

The definition and explanation of acronyms and terminology, information about trademarks used, and typographic conventions can be found in the following documents:

1.3   Revision Information

Other than editorial changes, this document has been revised from revision C to D according to the following:

2   Alarm Description

The alarm is issued when the Network Time Protocol (NTP) server(s) cannot be contacted or if the local time is off by more than the threshold value of 10 seconds.

The following is as list of the alarm attributes:

Note:  
This view of the alarm attributes will be presented to the user from Common Operation and Maintenance (COM), only when the LDE adaptations for Component Based Architecture (CBA) have been installed and the LDE alarm model has been registered to COM.

Attribute Name

Attribute Value/Interpretation

Major Type

193

Minor Type

3341942785

Managed Object Class

SafNode

Specific Problem

LOTC Time Synchronization

Event Type

6(1)

Additional Information

Not applicable.

Perceived Severity

Critical: There are time differences between the blades in the cluster that exceeds the threshold value


Major: The configured ntp servers are not accepted by ntpd (unusable), no ntp server is reachable (unreachable) or a peer cannot be selected (rejected)


Minor: Some of the configured ntp servers are unreachable

(1)   Environmental


1. The possible cause for the Critical severity alarm is:

2. The possible causes for the Major severity alarm are:

As a result of the fault, the time within the cluster might not be synchronized.

3. The possible cause for the Minor severity alarm is:

Note:  
The initial synchronization time for a newly started or rebooted cluster can be up to 20 minutes.

Note:  
There may be a long delay before the configured ntp servers are reported as unreachable by the local ntpd. This delay depends on various polling times of ntpd. Therefore, any alarm will not be raised until after these polling periods have completely elapsed.

3   Procedure

To clear the alarm, perform the following steps:

  1. If the affected node is a payload node, go to Step 3.
  2. Check that the NTP server(s) listed in the cluster configuration are correct and have network connectivity from the cluster nodes.

    If the name or address of any NTP server must be updated, see the following document for further information about how to configure an NTP server:

  3. The NTP service will be restarted with the new servers when the lde-config --reload command is issued
  4. Wait 20 minutes.

    If the alarm ceases, exit this procedure.

  5. Reboot the affected node.
    Warning!

    As a consequence of a reboot, any application may lose sessions or traffic. Therefore, restart only one node at a time and only if the state of the cluster as whole is stable and running.

  6. Wait up to 20 minutes.
  7. If the alarm does not cease, contact the next level of maintenance support. Further actions are outside the scope of this operating instruction.

Reference List

[1] LDE Glossary of Terms and Acronyms, TERMINOLOGY, 1/0033-APR 901 0551/4
[2] LDE Trademark Information, LIST, 1/006 51-APR 901 0551/4
[3] Typographic Conventions, DESCRIPTION, 1/1551-FCK 101 05
[4] LDE Management Guide, USER GUIDE, 1/1553-CAA 901 2978/4