Storage Engine, Unrepaired Data Inconsistency between Replicas, PLDB
Ericsson Centralized User Database

Contents

1Introduction
1.1Alarm Description
1.2Prerequisites

2

Procedure

Glossary

Reference List

1   Introduction

This instruction concerns alarm handling for the Storage Engine, Unrepaired Data Inconsistency between Replicas, PLDB alarm.

1.1   Alarm Description

This alarm is raised when the Data Repair procedure was invoked for the current Processing Layer Database (PLDB) master replica, but some of the inconsistencies between the current and the former master replicas have not been successfully repaired.

The alarm is issued in the following situations:

The possible alarm causes and the corresponding fault reasons, fault locations, and impacts are described in Table 1.

Table 1    Alarm Causes

Alarm Cause

Description

Fault Reason

Fault Location

Impact

Some of the detected data inconsistencies between the current and former PLDB master replicas could not be repaired.

Data Repair was executed with the identified LDAP entries that might not have been correctly replicated between the former and current master replicas, and it failed to repair some of these entries. These LDAP entries are recorded in the unrepaired log.

The repair of an entry may fail due to different reasons. These reasons are stated in the unrepaired log. The possible causes are that Data Repair was not able to repair at least one LDAP entry stored in the PLDB due to one of the following reasons:


  • The timestamp of the entry on the current master is later than the incident timestamp.

  • The entry has no timestamp.

  • The entry could not be fetched from the former master due to some errors.

  • The entry does not exist on the current master, and inserts are not performed.

  • Concurrent write access was detected between repair and provisioning or traffic through the CDC mechanism.

  • The repair process was interrupted.

  • LDAP error happened during repair (either during fetching the entry from the current master or updating it).

Current and former PLDB master replicas.

Some provisioning and traffic data updates may be missing in CUDB.

Incident timestamp refers to the time when the network incident, for example a network split or PLDB mastership change happened in the CUDB system. For more information, refer to CUDB Data Storage Handling, Reference [1].

CDC means Collision Detection Counter, refer to CUDB LDAP Interwork Description, Reference [2] for more information.

The following are the consequences for the node if the alarm is not acted upon:

The alarm attributes are listed and explained in Table 2.

Table 2    Alarm Attributes

Attribute Name

Attribute Value

Auto Cease

No

Module

STORAGE-ENGINE

Error Code

26

Time

Date when the alarm was raised.

Resource ID

.1.3.6.1.4.1.193.169.1.1.25.<TIMESTAMP>

Alarm Model Description

Unrepaired Data Inconsistency between Replicas, Storage Engine

Alarm Active Description

Storage Engine (PLDB): Unrepaired data inconsistency between replicas, major (task <TASKID>, blade <BLADE>)

ITU Alarm Event Type

processingErrorAlarm (4)

ITU Alarm Probable Cause

databaseInconsistency (160)

ITU Alarm Perceived Severity

(4) - Major

Originating Source IP

Node IP where the alarm was raised.

The alarm attributes are listed and explained in Table 2:

For further information about attribute descriptions, refer to CUDB Node Fault Management Configuration Guide, Reference [3]. The alarm must be cleared manually.

For the interpretation of the unrepaired logs, refer to CUDB Automatic Handling of Network Isolation Output Description, Reference [4].

1.2   Prerequisites

This section provides information on the documents, tools, and conditions that apply to the procedure.

1.2.1   Documents

Before starting this procedure, ensure that you have read the following documents:

1.2.2   Tools

Not applicable.

1.2.3   Conditions

Not applicable.

2   Procedure

Do the following:


Glossary

For the terms, definitions, acronyms, and abbreviations used in this document, refer to CUDB Glossary of Terms and Acronyms, Reference [5].


Reference List

CUDB Documents
[1] CUDB Data Storage Handling.
[2] CUDB LDAP Interwork Description.
[3] CUDB Node Fault Management Configuration Guide.
[4] CUDB Automatic Handling of Network Isolation Output Description.
[5] CUDB Glossary of Terms and Acronyms.
Other Ericsson Documents
[6] System Safety Information.
[7] Personal Health and Safety Information.