Abstract

The RAMS (reliability, availability, maintainability, and security) of the air braking system is an important indicator to measure the safety performance of the system; it can reduce the life cycle cost (LCC) of the rail transit system. Existing safety analysis methods are limited to the level of relatively simple factual descriptions and statistical induction, failing to provide a comprehensive safety evaluation on the basis of system structure and accumulated data. In this paper, a new method of safety analysis is described for the failure mode of the air braking system, GO-Bayes. This method combines the structural modeling of the GO method with the probabilistic reasoning of Bayes methods, introduces the probability into the analysis process of GO, performs reliability analysis of the air braking system, and builds a big data platform for the air braking system to guide the system maintenance strategy. An automatic train air braking system is taken as an example to verify the usefulness and accuracy of the proposed method. Using ExtendSim software shows the feasibility of the method and its advantages in comparison with fault tree analysis.

1. Introduction

Safety evaluation techniques of RAMS (reliability, availability, maintainability, and security) originated in the 1930s. As the awareness of the importance of safety has continued to grow, system safety engineering has become a mature discipline and has gradually been extended to the aviation, nuclear, petroleum, chemical, and general manufacturing industries. Many theories and methods have been proposed, such as safety checklists [1], safety analysis [2] and evaluation methods [3], event trees (ETs) [4], fault tree analysis (FTA) [5] and risk assessment techniques [6], mode evaluation, human factor analysis and classification system (HFACS) applied on UK rail safety [7], six-stage safety and other risk index evaluation methods, and artificial neural networks.

In the field of rail transit, RAMS [8] is widely used to analyze the reliability of equipment operation. In 1986, Sweden issued the first requirement for the RAMS index in the invitation to purchase high-speed trains, requiring suppliers to make commitments for their reliability, maintainability, and safety, and ensure that all indicators meet the specified value after being put into operation. In the 1970s, the failure data of the Shinkansen was analyzed in Japan, and the reliability of the locomotive was improved by improving the design. TGV trains in France have strict requirements on RAMS during the design phase, and their maintenance costs are decreasing year by year. Kang et al. [9] made an evaluation of the smooth state of RAMS on the high-speed railway track. Pu [10] introduces RAMS to the urban rail train monitoring system. Chen [11, 12] analyzed the key elements of RAMS management in the rail transit industry. Cooper and Herskovits [13] present a Bayes method for constructing probabilistic networks and make a credibility evaluation of the system. Dobrivoje et al. [14] qualitatively analyze the fault tree of the hydraulic booster in the commercial vehicle steering system. Holmes and Mallick [15] proposed a Bayes analysis of the piecewise-linear model. Adjusting the data yields a Bayes local linear model with predictive and local linear parameter distributions. Sun [16] analyzed the RAMS control measures taken at various stages of the signal system. Yasuoka et al. [17] explain how to use RAMS for railway signal system management. Yang et al. [18] introduced the RAMS analysis of the high-speed railway power supply system. The GO method is widely used in the RAMS analysis of rail transit systems [1821]; Huang et al. [22] apply RAMS to 9 electromechanical subsystems of Chengdu Rail transit line 1, but there are a few cases successfully used in air braking systems. Many scholars use the GO-FLOW method in the safety analysis of a high-speed railway [2328]. Some scholars apply big data to the operation and management of rail transit to guide maintenance strategies [2934].

In this paper, the GO-Bayes is proposed, its basic concepts are described systematically, structured bidirectional modeling, Bayes theory fusion, and various detection information, model-based inverse fault probability reasoning, and other method simulation using the Monte-Carlo method and through a subway transit air braking system example to verify its effectiveness and accuracy.

2. Train Air Braking System

Train air braking is also called a mechanical brake or friction brake. A complete brake system device consists of two parts: a brake control system and a brake actuation system. The brake control system consists of the brake signal generation and transmission device and the brake control device. The brake actuation system is usually called the basic brake device, with a brake shoe brake and a disc brake; there is no essential difference between them, and the way of modeling is the same; only the way of a working form is different. The system structure diagram is shown in Figure 1.

2.1. Basic Components of Air Braking System

A braking system’s basic components include an air compressor and filtration device (as shown by A5 in the schematic diagram in Figure 2, 1), a main duct, pneumatic suspension devices (L1, …), a braking airline and control system (B1, …, including a parking braking device B7 and a braking control section B13), main brakes (C1, …), and electronic antiskid devices (G1, …); the meaning of code letter is described in detail in Figure 3, 2.

The brake control system is composed of the following: A5, B2, B7.01, B15, B3.01, and L10—air feed and dust filtration device; B1, B5, B6, B11, B16, B19, B20, L3, L9, B7.02, and B7.07—stop valve; B8, B7.06, B13.04, B13.08, and B13.12—safety valve; U1—safety lift device for pantograph; B7—parking brake control device; B9—bake tank; B10—air evacuation valve; B13—common/emergency braking control unit; L1—pneumatic suspension devices; L5—pressure control valve; L6—hydrogen-oil differential pressure valve; L7—air spring; L8—dummy reservoir; G1—electronic antiskid devices; G2—measure speed gears; G3—speed transmitter; C1, C2, and C3—air hose; C4 and C5—damper brake shoe; C6 and C7—common/emergency braking basic unit; C8—parking brake basic device; B7.03—parking brake control magnetic valve; B7.04—double branch valve; B7.05 and B13.13—air switch; B13.02 and B13.03—common brake control magnetic valve; B13.05, B13.07, and B13.11—pressure sensor; B13.06—brake relay valve; B13.09—empty and load brake equipment; W1—linking equipment; and B13.10—emergency braking control magnetic valve. A dotted box represents a function unit; a solid box represents a control cabinet, as shown in Figure 3.

2.2. Air Braking System Failure Modes

The air braking system is composed of a series of components; these components in the long-term use of the process will have varying degrees of failure, such as air compressor failure, drying tower gas leakage, and subway vehicles in the operation of these faults. Record it so you can look through the table to get the probability of different parts failing. The failure rate is calculated as follows:

In the process of urban rail transit train operation, the maintenance management system records the relevant fault date, fault description and processing results, and other information, as shown in Table 1.

2.3. Problems in Train Air Braking System Safety Analysis

As we all know, vehicles have the characteristics of high speed, large passenger capacity, and complicated environment. According to the Guangzhou Rail transit’s operational failure statistics, among the numerous breakdowns in the vehicle, the bogies (33.3%) and the braking system (16.7%) have the highest failure frequency. As the key equipment of the vehicle, the braking system itself has the characteristics of frequentness and complexity. Once a failure occurs, it will directly affect the operation of the vehicle, even cause vehicle accidents. At present, the safety assessment of the vehicle braking system mainly adopts some traditional analysis methods, including the safety inspection table method, prehazard analysis (PHA) analysis, event tree analysis (ETA), failure mode and consequence analysis (FMEA) method, and fault tree analysis (FTA) methods, which are still traditional qualitative, quantitative, and combined methods; they are only limited to a simpler factual description and statistical induction level, and it is difficult to achieve comprehensive safety assessment from the system structure and information level.

Therefore, this paper introduces the Go-Bayes method to the safety analysis of the air braking system and describes the massive fault information as a probabilistic meaning, which greatly simplifies the analysis workload. The Go system-based modeling structure and Bayes probability theory can realize information fusion and data-driven comprehensive security analysis methods and achieve more effective safety assessment.

3. GO-Bayes Model of Air Braking System

The GO-Bayes method is based on the fault of the system unit. It uses the basic unit model and logic module to establish the analysis model according to the system working principle diagram or flow chart and calculates the reliability probability parameters according to certain operation rules. At the same time, the Bayes method is used to carry out the real-time fault deduction of the system’s security state and solve the reverse probability so as to realize the comprehensive evaluation of the system security. The GO-Bayes method is applied to the safety analysis of the train air braking system with the following innovative features: (1)The structural features of the air braking system are modular, and the Go model inherited by the Go-Bayes method exactly meets the modeling requirements(2)The development of state monitoring technology makes it possible to obtain more monitoring information, combining historical failure information and monitoring information effectively can better analyze the security state of the system, and the Bayes theory introduced by GO-Bayes method is the key method of information fusion(3)When the system fails, we can combine the real-time status monitoring information and experiential knowledge to locate fault points based on probability. The Bayes theory is used to obtain the “reverse probability” and achieve quantitative and qualitative analysis. It has a practical significance for the safety analysis and fault detection of air braking systems

The GO-Bayes method inherits the GO model diagram (schematic, flow diagram, or engineering). Therefore, the basic operator of the GO method is used in the modeling process, as shown in Figure 4.

In the modeling process of the GO-Bayes method, firstly, the basic unit models are summarized and give a calculation of the units. Secondly, according to the system structure, the system model is built in the basic unit, and the units are connected in sequence. The original input data is calculated by the algorithm of all the units in the system model, and the success rate of the system output is obtained. Similarly, based on the results of the forward real-time assessment, the inverse probability inference is reintroduced to obtain real-time failure analysis results. The system adopts a hierarchical structure of modules; that is, the system is composed of subsystems, and the subsystem is composed of basic components; only input/output data interfaces and other control interfaces are provided. The air braking system is a large system with a complex structure and a large model structure. It is easy to read and manage with a hierarchical model, as shown in Figure 5:

The common unit models in the project are the series model, parallel model, and voting model. For discussion purposes, make the following assumptions: (1)The system and unit have only two states: normal and faulty(2)The normal and faults between each unit are independent(3)Since some units are connected to other units through input and output ports, for example, air compressors have many components, but we only need to consider whether it can compress air normally. So these units can be modeled as a whole without considering the internal security status and connection of components

3.1. The GO-Bayes Model of Basic Units

The system adopts a hierarchical modeling structure and divides the braking system into six parts: air supply device, brake air circuit, air spring suspension, brake control unit, parking brake control device, and basic brake device. The detailed model of each basic unit is as follows. The meaning represented by the component number can be queried in Table 2: (1)Air Supply Device. The drive motor drives the air compressor device to work. After the compressed air is dried by the drying tower, it is injected into the total air cylinder for storage, as shown in Figure 2(2)Brake Gas Path. Compressed air is mainly transported through the main air duct of the train, and some shut-off valves, safety valves, etc. are required to ensure the circulation or cut-off of compressed air, as shown in Figure 6(3)Air Spring Suspension. Compressed air is delivered to air spring air cylinders and air springs through stop valves, filters, etc. The left and right height of the vehicle body is adjusted by the height valve and differential pressure valve, and the pressure information is transmitted to the brake control unit, as shown in Figure 7(4)Brake Control Device. The brake control device (BCU) is mainly connected to the compressed air delivered by the brake cylinder, the control command issued by the brake microprocessor control system (ECU), and the pressure of the air spring suspension device and so on. The control analog converter converts the control command to “precontrol air pressure” and controls the relay valve to output the brake cylinder after being restrained by the pressure switch and the weighing valve, as shown in Figure 8(5)Parking Brake Control Device. The parking brake device is directly connected to the brake compressed air, and the parking brake solenoid valve and pulse valve are controlled by the parking brake instruction to prevent the vehicle from running, as shown in Figure 9(6)Basic Braking Device. The basic braking device is an implementing part of the braking system; the compressed air is finally delivered to the brake air cylinder to push the brake shoe and the wheel to generate friction to complete the braking, as shown in Figure 10

In the establishment of the system GO-Bayes model, some units describe the object as a single component, and some units describe a set of functional devices, because the device and the internal parts of the low failure rate or failure data are not easy to obtain, such as air compressor, analog converter, ECU code, relay valve, cut-off valve, and parking braking solenoid valve. We will not list them one by one, but their work effect is only related to the probability of success of the unit; we can simplify the modeling complexity of the whole system by means of unit modeling. Therefore, the modeling process should be based on the specific characteristics of the object system flexible modeling. Another feature of the use of hierarchical block modeling is that one of the modules can be analyzed separately in the use of analysis of the entire system analysis; of course, the entire system analysis must pay attention to the correlation between modules such as common signal processing.

3.2. The Processing Method of the Common Signal in the Model

In the system model, there is a unit output to two or more than two successive units to be merged as the input of subsequent units; then the input data is called common signal, as shown in Figure 11.

The existence of the common signal will cause high-order subitems in the forward channel of the system. The value of these probability subitems is very small, which will affect the accuracy of the analysis. Therefore, we must preprocess the common signal.

The output of single 1-1 is the input of 2-2, 2-3, and 2-4. When 2-3 and 2-4 are used as 5-5 inputs, when calculating the unreliability, two input signals contain 1-1 output data and need correction. The same is the case that 2-2 and 5-5 also contain 1-1 of the output data as the input of 5-6.

The processing method is when the common signal branches are reassembled, the reliability of unit 1-1 will appear two times and higher times and then replace two times and higher order items into one item.

Set up , , , and ; when calculating the 5–5 output, it consists of common signals:

In the meanwhile,

Obviously,

It consists of a square term ; here, we replace with , and the result is shown as follows:

In the reverse fault probability inference, there is no need to deal with the common signal; that is to say, when the system comes to fault, , the reverse probability of unit 2-2 output signal is

The reverse probability of unit 5-5 output signal is

The reverse probability of unit 2-2 itself is

3.3. Calculation of Probability Indicators

Rail transits run continuously for a long time and are not easy to analyze. We can take the average processing method and average the fault events to the time unit of 100H. Suppose a 100H steady running time, as shown in Table 3. In the calculation process, indicates the failure probability of units when the system fails, indicates the normal probability of units when the system fails, indicates the failure probability of units when the system is normal, and indicates the normal probability of units when the system is normal. According to Bayes’ theorem, the posterior fault probability can be obtained as

As can be seen from the above equation, the posterior fault probability is closely related to the prior probability and presents a positive correlation. (1)Original data: the component failure rate data come mainly from the real-time train safety monitoring system. Assuming a steady-state operating time of 100 h, the details and calculation results are shown in Table 4.(2)Calculation results

3.4. Experimental Analysis

From the above safety analysis, we can draw the following conclusions: (1)When a system exhibits abnormal behavior, we have to obtain the real-time inverse probability through a fault backward reasoning method. The inverse probabilities of the components (3-B4, 0.01), (1-B5, 0.15), (3-D1, 0.04), (1-D2, 0.78), and (1-F11, 0.47) are significantly larger than those of the other inverse components, which indicates that these parts may be abnormal. We should therefore focus on tracking them. In addition, by using the system diagram model to analyze 3-B4, 1-B5, 3-D1, and 1-D2, which are working parts that are connected together, the abnormal output of 3-D1 indicates that the possibilities of failure of these four components are very large, with that of (1-D2, 0.78) being the highest. This represents the electronic control unit, which has a higher error rate than other components because of the many electronic circuit elements that it contains. While (1-F11, 0.47) is an independent failure, in fact, it represents the speed sensor with a self-resetting function. False detection of failure occurs frequently for this component. If an abnormal event is detected when its probability of failure is less than 1/2, it should be checked and then repaired if necessary. This is because the failure probability of the equipment is generally not very large. If there is a large probability of failure, it indicates that there is a possibility of error detection or improper operation, so the threshold value of 1/2 is set up(2)Traditional fault probability calculations, such as FTA, ETs, and HFACS, depend on the forward deduction of historical data. By contrast, the GO-Bayes method provides structural models of a system and an inverse reasoning probability. The model output and inverse probability reflect the system’s reliability more accurately than traditional fault probability

The GO method was initially developed to solve reliability problems for complex systems, and the proposed GO-Bayes method combines it with a Bayes approach and considers both normal and reverse probabilities, thus making the method adaptable to a complex system such as a transit braking system.

4. Big Data Platform of Air Braking System

The operating data of the braking system is transmitted to the ground equipment, and through the execution of big data mining, the operating status and events in the running process are analyzed. Through GO-Bayes analysis, the fault early warning and hidden trouble investigation of the equipment are finally achieved.

4.1. Big Data System Architecture

As shown in Figure 12, the system consists of five parts: log download device, wireless transmission device, data operation server, GO-Bayes fusion module, and visualization terminal.

4.2. Key Technologies

The big data platform mainly includes the following key technologies: (1)Big data storage and processing: the main storage and processing modes can be divided into two kinds of stream processing and batch processing. Stream processing is direct processing. It treats data as a stream. When new data arrives, it is immediately processed and returns the desired result. This mode is suitable for critical events with high real-time requirements: air compressor failure, dry tower leakage, etc. Batch processing is stored before postprocessing. This mode is suitable for parts that do not require high real-time performance(2)Data mining and fusion: air braking system will generate a large amount of data during operation, such as various status data and equipment maintenance data. Through the mining and fusion of the braking system’s operating data, the potential relationship between data can be excavated and the system can be obtained. As a result, we can realize the real-time, continuous, macroprecise grasp of the overall situation of the dynamic system and the accurate perception of local, multiangle, multilayer, and multimode(3)Data visualization: data visualization technology refers to the use of computer graphics and image processing technology, through the comparison of multidimensional data, mapping through graphics, animation, and other visual means; it is easy to reveal the laws of the data; the correlation between the various factors found abnormal data, etc., while facilitating the understanding of the results of big data analysis

5. Monte Carlo Simulation of Air Braking System GO-Bayes Method

Generally speaking, the calculation process of the GO-Bayes method is very complicated and requires significant computational resources. There is thus a high cost in terms of energy and time and a high risk of error when the method is used for sophisticated systems. Therefore, to deal with this issue, a Monte Carlo simulation approach is adopted here to find a numerical solution based on the GO-Bayes method. Historical data and current state variations are combined to provide a predictive analysis for future reliability and security.

5.1. Hypothesis Simulation of System Distribution

The failure process is an inherent physical property of the device, and it is a long-term process under steady-state operation, with a constant failure rate. Such a system is also known as a Markov model repairable system. Its reliability model has an exponential distribution.

There are components in the system, with each unit and the system as a whole having only two states, normal and faulty, represented by 1 and 0. The state function is defined as

is the state of the system at time and the state of each part at time .

Based on the analysis of the rail transit braking system using the GO-Bayes model, Monte Carlo simulation is used to check the results of the analytical method described above.

5.2. Simulation Process

(1)The first simulation before system failure: for the first simulation before system failure, each component of the system is absolutely new, and the simulation proceeds by examining the security and reliability of the system as time changes. The process is as follows: (a)The number of simulation is set (b)The maximum system failure time and the number of intervals for the statistics, , are set. The interval length is . It is assumed that all system failure times are within this interval(c)Uniform random numbers in the interval [0,1] are generated and converted into a random sample for each unit failure time, i.e., (d)Units in minimal cut sets are obtained by qualitative analysis using the GO-Bayes method, and failure time sample values are obtained from the units of each minimal cut set(e)The failure time of the system is calculated by obtaining the minimum sample value of the failure time of each minimal cut set, i.e., (f)If , then the cumulative number is increased by 1, and the procedure is repeated(g)The simulation is repeated, and the system failure time is accumulated in each interval until , where the simulation ends

The initial failure probabilities are 0 at the first simulation. In the new state, the working times, performance, and tendency are obtained as the time sequence values before the first failure. The system security trends are also evaluated. (2)Dynamic prediction and simulation of system security status: as changes in the security status of an urban rail transit train braking system occur mainly during operation, fault simulation and forecasting of future operating trends are critical aspects of a system safety study. This process is similar to simulation of the first failure. However, there are some differences: it is necessary to track the servicing time of each unit and confirm its current reliability. In other words, simulation must be done after determining the service time

The simulation process is subject to the following hypotheses: (a)The reliability state of each unit is steady during the maintenance periods resulting from system failure(b)Failed parts are removed immediately and repaired offline in the maintenance period. Meanwhile, new parts are installed to keep the system working in a normal state with no effect on its operation

The simulation process is changed on the basis of the simulation of the first failure time, using random samples of unit failure times, as shown in Figure 13, where is the present unit service time and is the remaining service time. The current failure time is sampled to predict the possible service times of the unit in the future.

The related unit service times are shown in Table 5.

We can derive the following results from this experiment, as shown in Figure 14: (1)The reliability curve before the first failure is the same as the regular probability curve, but they differ from each other after a maintenance and failure update(2)The trend of the normal probability curve after a system update is worse than before the system’s first failure. The reason is that some old units remain in their normal states in the updated system, which will make it easier for the system to get into a failure state. The reliability is also lower(3)After a system update, the reliability is worse than that of the initial system, although the system can operate as normal. As time goes on, the probability of normal operation is closer to the reliability of the system. It should be noted that reliability is fundamental to safety, while safety provides a reflection of the real-time state of reliability

6. Conclusions

From the monitor data of the experiment, the new structural GO-Bayes approach to RAMS presented in this paper is a comprehensive methodology for system safety analysis and evaluation modeling. Using a system diagram model, it is possible to obtain a system’s normal work probability output, which is essential for fault backward reasoning. This paper has discussed the basic components or units of a system and the related analytical results. Application of the proposed method to a rail transit vehicle air braking system has illustrated its potential contributions to safety analysis and assessment. Finally, it has been shown that the GO-Bayes method provides accurate results consistent with those obtained by a Monte Carlo simulation process and, with progress in simulation techniques, the respective results will become closer. The results can be used to trace, maintain, and improve system components and eventually ensure the safe operation of the system as a whole.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

Acknowledgments

This work was supported by the China High Technologies Research Program (RCS2017ZJ001 and Z171100002117007).