The difference between reliability and availability

What is the difference between reliability and availability?

Terry O
 
View Printer Friendly Format
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Thanks Josh

That article was the inspiration for my question.

What do you think of the points made in the article?

Terry O
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
If it's reliable, it's available. However, if it's available it's not necessarily reliable.

Reliability is a state of knowing where the machine is along its failure curve. A spare can be available but without knowing its reliability it may not be available very long. However, if the spare gets you through the 'out of service equipments' repair then the pumping system is available; therefore, the process is available. Although this presents available, the MTBF will only be extended through a reliability program or so I think.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Although RAM is important, I guess not many do RAM modelling and simulation.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Terry,
PaulB is a well respected authority in the field.

quote:
That article was the inspiration for my question.

What do you think of the points made in the article?


I am not sure if the majority of us need the maths lesson to understand, or even if it will actually help understand the terms.
Availability is simply the time an item is able to operate (operating time+standby time) as a ratio to the time in service. It tells us how much the cash machine can yield if we feed it with the required inputs e.g. raw material, power etc. and remove the product as it is made.
Reliability is the probability that at any point in time, it will operate correctly for a further specified length of time. For example, a pump has been running well for 3 months; what is the probability that it will run for 6 more months, 12 more months etc.
The two are related; availability depends on reliability.

PaulB says (incorrectly) that the two are never the same. He ignores a class of items called non-repairable, which are replaced with a new (or as good as new or AGAN) item when a failure takes place. Think of light bulbs, ball bearings, smoke/gas/fire detectors, emergency shutdown valves etc., and you will recognize what I mean. In all these cases, once we know it has failed, we will simply replace it with a new or AGAN item. In this specific case, the reliability and availability are numerically the same.

But for the majority of items, which are repairable, his comment is perfectly true.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Terry

Here's my 2 cents on it.

Reliability is all to do with the probability that the system will function correctly and is usually measured by the MTBF.

Availability is the percentage of time that the system is capable of functioning correctly. This is usually measured by calculating uptime divided by the total hours for a given period (i.e. 8760 hours in a year)

You can always dig into the data to differentiate between planned and unplanned downtime to get a clearer picture of causes. When MTBF and availability are reviewed together it can give a better analysis of what is going on. For example, you could have a low MTBF but if the downtime is only a few minutes each time the failure occurs then the your availability could still be be very high.

If the system was switched off during a shutdown to allow maintenance or repair etc of other equipment / systems and is successfully started up again when the shutdown is over was it available? I think so but interested to see if others agree.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
It's easy to forget the intent is to monitor the effectiveness of the maintenance effort. If you're lucky enough to have the data readily available, both availability and reliability can be tracked over time as a measure maintenance value. If the two do not track together, the relevant factor will be downtime. As an example, suppose maintenance becomes more proficient at repairing a recurrent failure, reducing downtime. Availability will improve but reliability will not be improved. Certainly, there is value to the improved maintenance efforts even though the root cause of the failures has not been corrected, and the reliability remains unchanged.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Availability is the probability of having a piece of equipment available to work at any moment within a certain time span (long enough to comprise several episodes of failure and repair).

Reliability is the probability of a piece of equipment not failing until a certain moment in time (mission).

Availability depends on Reliability and Maintenability (time to recover following a failure) but the opposite is not true. On the other hand, Reliability and Maintenability are independent of each other. You can have a machine with high availability although its reliability is low if it takes little time to repair or still you can have a machine with low availability despite its reliability is high in case the times to repair are long.

The attached spreadsheet is a Monte Carlo simulation model that allows you to simulate 1,000 episodes of failures and repairs (moments in columns E and F) of a critical component pertaining to a piece of equipment. The renewal of components is supposed to be a homogeneous Poisson process, that is, the failure behaviour of each component is exactly the same as the one that preceded it. The failure behaviour is according to a Weibull distribution (the three parameters are in cells C4 to C6) and the time to repair follows a LogNormal distribution (the two parameters are in cells E4 and E5). The mission is entered in cell C9. In cells O4 through X103, 1,000 moments of need of the equipment are generated at random between 0 and the latest simulated event (cell F2013).

After each iteration (by pressing the key function F9), you can observe:

 The reliability that results in cell H10. If you compare this to the theoretical value in cell H8, you notice that they are quite close – the former tends to the later as the number of iterations increase or the number of repetitions increase by calculating the expected value of cell H10;
 The availability (as a probability) that results in cell J10. If you compare this to the theoretical value in cell J8, you notice that they are quite close – the former tends to the later as the number of iterations increase or the number of repetitions increase by calculating the expected value of cell J10;

In cell J11 you have the availability that also resulted from the simulation but as a natural frequency instead.

In conclusion, the Monte-Carlo simulation model presented here proves my definitions of Reliability and Availability described above.

Regards,

Rui
 
 
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Let me just rephrase your question

1st Question : If the equipment's Availability is 100%, is the equipment said to be at its peak & reliable ?

Answer : NO, The equipment may be 100% AVAILABLE but have a low UTILIZATION or have not been utilized at all, hence availability is
not a measure of reliability but only when the machine is used. You drive your car to work and then you park it in your building, your car is 100% available but it was only utilize for a couple of hours, 1 hour going to your work and 1 hour going home

If the equipment's Availability and Utilization is 100%, is the equipment said to be at its peak and reliable ?

Answer : NO, The equipment may be 100% Utilized and Available but machine may suffer from errors and speed loss. Example a machine
speed need to be reduce from 1500 uph to 1200 uph so as not to deliver problems in quality.

In laymans terms, you might be 100% utilizing your car and using it 24 hours of driving, but you cannot speed up since if you do then you will hear a lot of unwanted sound in your engine, transmission, tie rod bearing etc and you know soon that it will break down if you continue to travel at the speed you want.

If the equipment's OEE is 90 % or more, is the equipment said to be at its peak & reliable ?

Answer : NOT EXACTLY, the real objective of OEE is not about achieving 85% or more but about satisfying the 3 components, in the case below
the equipment is suffering from a large rate of defects.

OEE = 100% (Utilization) x 97% (Efficiency) x 93% (Yield) = 90.21%

In this case the machine is failing

Hope this helps,

Rolly Angeles
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee,

You quote :

Availability is simply the time an item is able to operate (operating time+standby time) as a ratio to the time in service.

I think you are speaking about utilization and not availability, these 2 are different in my own perspective and from what I am teaching.

The machine may have been available but not utilized. But the machine can only be utilized if its available.

Rolly Angeles
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Josh,

In your article, it states MTBF or MTTF are these two the same or they are different ?

Appreciate any feedback from this,

In my opinion and understanding these two Mean indicators are entirely different but related in a way with each other, I'll explain later.

Rolly Angeles
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rolly,
I am not sure I follow your query, as I think we are saying exactly the same thing. You say,
quote:
You drive your car to work and then you park it in your building, your car is 100% available but it was only utilize for a couple of hours, 1 hour going to your work and 1 hour going home

How does this differ from
Availability is simply the time an item is able to operate (operating time+standby time) as a ratio to the time in service
As you correctly point out, Utilization is the ratio of operating time (read - driving for 2 hours) to total time, and availability is as I defined and you illustrated (read - driving + parked time).
As far as MTTF is concerned, it is only applicable when the reoair is to As Good As New (AGAN) condition. It is always applicable to non-repairable items, and for those cases of repairable items done to AGAN standards. Some people (incorrectly) bring in the issue of repair time, which is NOT the criterion to use. The point is that AGAN repairs bring the reliability back to 100% at the start of the new run, while other repairs do not bring it to 100%, so the item starts off its new run with a lower reliability.
MTBF, on the other hand is applicable to repairable items (I gave some examples in my earlier post).
The formula used for computing MTTF and MTBF is the same, but they are applicable in different situations.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee,

I know your good but I got u this time, MTBF and MTTF are not the same, I'll explain later. The answer is in your reply. These 2 are different in my own understanding and humble opinion.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
I thought availability and utilization were two concepts long mastered by us all but after reading these last few posts, I myself became a little bit confused. Would you please point me out any desagreement in the following example:

One piece of equipment is available to work for typically 40 hours per week, that is, its capacity is 40 hours/week. Suppose two situations:

1. At one particular week it was loaded with 36 standard hours. During the course of the week the equipment failed while it was working and it took 2 hours to repair. Because (40 – 36) > 2, the work could all be accomplished and it took 36 standard hours or 37.5 actual hours (efficiency = 37.5/36 = 96%). In this instance, one may say that the availability was (40 – 2)/40 x 100 = 95% and the utilization was 37.5/40 x 100 = 93.75%;
2. At another week the equipment was loaded with 38 standard hours. During the course of the week the equipment failed while it was working and it took 8 hours to repair. Because (40 – 38) < 8, the work cannot all be accomplished and it took (38 – 8) + (40 – 38) = 32 standard hours or 30.476 actual hours (efficiency = 30.476/32 = 105%). In this instance, one may say that the availability was (40 – 8)/40 x 100 = 80% and the utilization was 30.476/40 x 100 = 76.19%.

Do you agree?

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
MTTF means Mean Time To Failure or "mean or average life" of a component regardless it is replaced or not immediately after a functional failure.

MTBF means Mean Time Between Failures and is the same as "the average time between two successive failures", therefore comprising the time to repair every failure, that is MTBF = MTTF + MTTR (Mean Time To Recover, Replace or Repair).

Suppose 3 failures of a component in a row (failure 1, failure 2 and failure 3) extending over a time span where event 0 is the origin of time and T is today. At moment 0 the component was working properly, at moment 1 failed and it was replaced by a new one, the same happened at moments 2 and 3. Today (moment T) the component is still working fine.

Call 1´, 2´ and 3´ the moments when the replaced components started again their work. Suppose that events (failures) 1, 2 and 3 took 1, 2 and 3 calendar hours to recover, that is, moments 1 and 1´ are 1 hour distant in time, 2 and 2´, 2 hours and 3 and 3´, 3 hours. Suppose furthermore that events 0 and 1 are 500 hours distant in terms of running time, 1´ and 2 are 700 hours and 2´ and 3, 600 hours.

In this instance, and from a management point of view, MTTF is equal to (500 + 700 + 600)/3 = 600 hours, MTTR = (1 + 2 + 3)/3 = 2 hours and, finally, MTBF = 600 + 2 = 602 hours. The most important issue here is to track how this indicator is behaving over the time (increasing, which is good, or decreasing).

Please note that, from an engineering perspective, the first interval (500 hours) and the last one (600 hours) should be censored.

If the system is not repairable, it doesn't make sense to address MTBF, of course.

In order to avoid confusion, I always refer a system as "being repairable in service" (the most frequent circumstance) or "not being repairable in service" (the case of a missile or a satellite after having been launched or still a parallel (redundant) arrangement of batteries located in a remote place where you go only from time to time on a regular basis). The missile and the satellite are "repairable" while they stay at the stores but they are not as soon as they are "in service". The batteries are not repairable in service, that is, while the maintenance person in charge is away, if one unit fails (the remnant batteries split the load), it will keep this way until somebody comes, notices and corrects the failed situation.

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui, in a plant environment, I think instruments are normally not repairable so replacement is a common policy eg maybe transmitters, thermocouples, pressure gauges, sensors, etc. However, worn out trims of control valves of punctured diafragam of their actuators can be replaced.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rolly, If I remember correctly from what I read, in most cases, MTBF is different from MTTF but there is a special case where MTBF is approximately equal to MTTF. Maybe Vee is pointing out this special case where MTBF~MTTF.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
I think there is an IMechE publication addressing this subject of repairable and non-repairable items quite in detail. I don't have the tittle offhands now.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
I saw Paul Barringer has registered but not commenting?
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
quote:
Originally posted by Josh:
Rui, in a plant environment, instruments are normally not repairable so replacement is a common policy eg transmitters. Hoever, control valve trims can be replaced.


Yes Josh, I quite agree. I correct myself: Even if that kind of devices are not repaired and are simply scrapped and replaced by new ones, it still takes time to do it. Therefore a TTR (Time To Replace) always applies after a failure, meaning that indicators MTTF, MTTR and MTBF are still possible to control as time goes by.

I think the term "repairable" is not the most appropriate and should be abandoned as in the end, every system can be "repaired" or "recovered". I would suggest "recoverable" instead. The only distinction, in my view, is to be able to do it while in service or not as I exampled in my last post.

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
"Recoverable" is more general in scope and would embrace both the terms "repairable" and "replaceable". Misunderstandings would be almost impossible.

What do you think?
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
The IMechE document is The reliability of mechanical systems by J Davidson under IMechE Guides for the Process Industries ISBN 0 85298 675 0.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui, Rolly,

Rui, your statement
quote:
MTBF means Mean Time Between Failures and is the same as "the average time between two successive failures", therefore comprising the time to repair every failure, that is MTBF = MTTF + MTTR (Mean Time To Recover, Replace or Repair).
and
...the missile and the satellite are "repairable" while they stay at the stores but they are not as soon as they are "in service". The batteries are not repairable in service, that is, while the maintenance person in charge is away,

is not in line with the authorities quoted below
1. The IMechE document "The reliability of mechanical systems by J Davidson under IMechE Guides for the Process Industries ISBN 0 85298 888 8 (ISBN differs from the one quoted by Josh), on page 34, says, Quote
Similarly there is a difference between the MTTF of a non-repairableitem and the MTTF of a repairableitem (emphasis mine).
The term repairable has a technical meaning and is not the same as in common English; items such as ball bearings and light bulbs or microchips are termed non-repairable. Also all items subject to hidden failures. Your examples of the battery and the missile do not meet the technical definition of 'repairable'.
2. The book "Reliability Maintenance and Logistic Support by Kumar,Crocker, Knezevic and Al-Haram, Kluwer, ISBN 0 412 84240 8" says on page 76, quote
MTTF is.....a measure of reliability of non-repairableitems... (emphasis by authors. On page 85, they say, quote
MTBF is.... a reliability measure for repairablesystems (emphasis mine).
On page 87, they say, quote
The value of MTBF is equal to MTTF if after each repairthe system is as good as new.

Two points;
a) MTTR has nothing whatsoever to do with reliability; it only affects availability. So it does not affect MTBF, see item (b0 below.
b) MTBF is not MTTF+ MTTR as incorrectly stated in your post.

In computing MTBF, we do only take the time available in service, excluding any downtime for repair.
In computing MTTF, replacement time is usually negligible, so calendar time is considered as operating time. This is arguable, but is acceptable in practical terms.

Rolly, you said
MTBF and MTTF are not the same,
I agree entirely with you; however their numerical valuesare computed using the same formula.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Thanks Josh for the information. I looked through the IMechE site and found:

Davidson J and Hunsley C (Editors), The Reliability of Mechanical Systems, 2nd Ed, J Wiley Publishing, IMechE, London 1994, ISBN 0 85298 881 8.

This is a guide for practicing engineers. It provides an introduction to the application of the methods and techniques for assessing the reliability of mechanical components, equipment, plant systems and subsystems.

The following subjects are discussed:

 The philosophy and concepts of reliability engineering
 Analysis of in service reliability experience
 A basic approach to reliability assessment for mechanical process systems
 Techniques for process plant reliability assessment
 Collection and processing of reliability data
 Case studies.

The subjects sound quite appealing to me. Is this the book that you meant Josh?

The title of one of the subjects called particularly my attention "Analysis of in service reliability experience". Davidson also makes use of the term "in service" as I do.

Despite I don't have the book, I have got many others on Reliability and Maintainability and they all refer to "repairable" and "non repairable" systems the usual way that causes confusion. That is why I proposed the term "recoverable" instead. Anyway, here is an example which illustrates what I mean.

Suppose a system composed by two elements (1) and (2). The duty element is at the present moment number (1) and the stand-by element the number (2). It could be a duty pump and a stand-by pump. Suppose that they are alike in what refers reliability and that any one of them fails randomly every 200 hours in average and still that the mean time to repair is 10 hours. Now suppose two different situations:

1. The system can be serviced at any time, meaning that as soon the duty element (1) fails, the stand-by one (2) takes over and the former is immediately taken to the repair shop to be repaired. As soon as number (1) is fully recovered it is put back into place and becomes the stand-by element in turn (a 50-50% arrangement). The system would only fail if the element (2) failed while element (1) was being serviced. This is what is usually known by a repairable system. I would rather call it "repairable (while) in service";
2. The system cannot be serviced at any time, meaning that as soon the duty element (1) fails, the stand-by one (2) takes over and the former stays indefinitely in a failed state. As soon as number (2) also fails, the system fails and only then both elements will receive service (the mean repair time will be then 2 x 10 = hours). This is what is usually known by a non-repairable system. I would rather call it "not repairable (while) in service";

To assess the performance of the two situations, you create a model of the system and use Monte-Carlo simulation to see what will happen if the model is run for a long time (long enough to accommodate a few hundreds of failures).

I run the model for a while and found the following results:

Repairable system (while in service):

Reliability for a mission of 100 hours: 0,9598;
MTTF = 4,610 hours;
Availability = 99.88% (which coincides with the theoretical value).

Non-repairable system (while in service):

Reliability for a mission of 100 hours: 0,9081 (which coincides with the theoretical value);
MTTF = 400 hours (which coincides with the theoretical value);
Availability = 95.21%.

Please note the difference in what MTTF (huge) and availability are concerned.

In conclusion, the two situations are far different.

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee,

Thanks for your observations.

The difference between MTTF and MTBF seems more like a semantic problem to me. I am aware that the way I interpret those two terms is not in line with the authorities referred by you. I know of a lot more authorities that adopt the very same definition but I also know others who adopt the same definition as mine. I can't see any reasonable reason to maintain such a cumbersome definition: meaning the same but applying to different systems in what maintainability is concerned. It gives rise to confusions and I face them frequently coming from practitioners. I feel quite comfortable since long with my definition (the same as many others) and I have already explained how I easily distinguish between a so-called repairable system (repairable in service) and a non-repairable system (not repairable in service). This is the way I teach my students, write in my books and presentations and advise practitioners – no contests so far! Smiler. Everything is clear and there is no room for confusions.

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui,
quote:
I also know others who adopt the same definition as mine. I can't see any reasonable reason to maintain such a cumbersome definition:

Cumbersome or otherwise, the equation MTBF = MTTF + MTTR is not correct, as far as I know. MTTR has absolutely nothing to do with reliability, it is a completely indepemndant parameter. I hope we can agree on this at least.
Perhaps you can quote the authorities who use your definition, for my reference.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee, do you consider an industrial air compressor as a repairable or non-repairable system?

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui,
I am answering using the technical definition of 'repairable' not the common English meaning.
Most of the compressor is repairable, but not, e.g., its safety relief valve, because that has a hidden failure mode (fail to lift on demand)
Now you have to tell me why you ask this question without addressing my earlier question.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee,

I briefly reviewed some of the English literature on Reliability that I have and found:

Time To Failure (TTF), means "time interval between two successive failures" and is an acronym which is historically applied to components which are not repairable, such as lamps and bearings. Time Between Failures (TBF), on the other hand, means exactly the same but is only applied to components which are repairable.

A side note: What to call a fun blade which is considered non-repairable if I decide, for some reason, to restore it by adding some eroded material by using a suitable and available technology? I remember once in a merchant ship a few years ago during an emergency, having had to restore the outer ring of a ball bearing mounted on a gear box as we hadn't any spares available aboard. Should I have called the bearing a repairable or a non-repairable item at the time? That depends on the circumstance, doesn't it? So why to keep this cumbersome definition which is the cause of so much confusion?

See on this purpose the image attached of an Excel application authored by Paul Barringer where he, himself, uses MTTF for the components and also for the compressor. Shouldn't he had used the term MTBF instead? As you said, the components (with a few exceptions) and the compressor are repairable.

TTF is translated into Portuguese as "tempo de bom funcionamento" or "operating useful time" and TBF as "tempo entre duas falhas consecutivas" or "time between two successive failures". TBF, the way it is translated, comprehends necessarily the time to repair (recover or replace) TTR. For me, the (historic) convention doesn't hold but rather what the words mean. In plain Portuguese TBF = TTF + TTR.

In result of our conversation, I am thinking seriously of replacing the two English acronyms TTF and TBF (the ones that have different meanings depending on whether you are speaking "technical" or plain English) in all I write from now on, by two others with real meaning in the Portuguese idiom. Thanks for the stimuli.

Regards,

Rui
 
 
 
Photos (1)
Air_compressor
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Dear RUI, VEE and all,

The discussion on this thread is getting deeper and deeper. First I would like to clarify my point on MTBF and MTTF.


In reality, machines do not fail as most of us usually try to say. Hey guys, this machine is failed go get me a mechanic. What is right is that a component, part or spare on a machine had failed. And so this is where I believe that repairable (MTBF) and non-repairable (MTTF) had been derived to distinguish one indicator from another. Well of course Im disregarding in this thread decommisioning the equipment !!!

It all started out in one of the company's I am teaching since, I am accustomed in using MTBF all the time, and another consultant who also teaching in that same industry was using MTTF all the time and people got confused if they are the same or different and so I need to make that distinction or otherwise one of us will loose our job of teaching.

If we speak about the machine as a whole then the correct term to use will be MTBF since machines can be repaired, right ? Now if a bulb, bearing, seal, capacitor, had failed and it cannot be repaired but replace then we speak about MTTF. But most of us still are accustomed in using MTBF of the bearing, MTBF of the light bulb and so on.

Vee, I have to go with RUI that when you consider the MTBF of the component, this is the mean time between failure, and we need to consider the repair time that took place to be able to get the machine running once again. While when we speak about MTTF, which is the mean time to fail then we speak about the actual time when the part had reached its last second to operate. Hence, I will go for the conclusion that MTBF = MTTF + MTTR.

I'll speak about availablity & utilization later on and problem with industries that there is no standard formula on this most specially when your equipment had not run or is not loaded.


Cheers,

Rolly Angeles
 
 
 
Files (1)
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rolly and others,
Let us start at the beginning. When we repair any component, we can bring it to a condition where its reliability at that point is 100%. We call this condition As Good As New or AGAN. As you know, with use, the component reliability falls, untill it fails. Now, repairs done in the field are invariably to a lower standard, partly because of access, lack of skills, special equipment or tools, lighting, dust etc. In such cases the component starts its new lease of life, not at 100%, only but some value lower than this, as if it had already been in service for some time. In extreme cases, the spare component is not available, but to keep the machine running, we may clean and dress up the old component, and bring it to a state it was in just before its failure. In this case, we call the rapair type As Bad As Old or ABAO. You do not expect the component to last more than a few days or weeks, but it gives you breathing time to get a new spare.

For all items that start each new run in AGAN condition, we compute MTTF.
For all items that start some of the runs in non-AGAN condition, the measure is MTBF.

In both cases, the formula we use is exactly the same, the difference is ONLY the fact that in the second case all the runs do not start with AGAN parts, i.e they did not have a reliability of 100% at the start of each run. THAT is the logic to use.

Coming to your complex machine, you are absolutely right. When a machine fails, in fact one of its components has failed. If we replace it with new ones each time, we can measure the component's MTTF. But the machine has, at that stage, many other worn components, whose reliability is less than 100%. So when we replace one or two components, the machine as a whole is still far from AGAN
when we restart it. So the applicable measure for the machine itself is MTBF.

For both MTTF and MTBF, we use the formula:
(Cumulative time available to operate)/(Cumulative number of failures), where
Time available to operate= Operating time+standby time
thus time to repair is ALWAYS excluded from this formula.

Time to repair is an independant variable, that has NO effect on reliability. Since MTBF and MTTF are strictly reliability measures, time to repair is not an issue when we compute them. It matters only when we compute availability.

Rui, as a point of clarification, we use the term 'repairable' to denote any component that does not come back to AGAN condition at the start of each run. Items that have to be replaced with new or AGAN components are termed 'non-repairable'. It does not mean they cannot be repaired, only that they are always repaired to AGAN condition. As one well versed in Reliability Engineering, this definition may appeal to you, as it means the same in English, Portugese or other languages.

For most of us practitioners, this distinction between MTTF and MTBF is not very important. For all practical purposes there is no difference, and not worth losing sleep on.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui, that's the book albeit ifenret ISBN no.

Just curious, have you done reliability modelling and simulation for industrial plants before?
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Yes Josh, I use simulation often to model and anticipate the behaviour of industrial systems.

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Vee,

The process that you refer is still more complicated and known by "minimal repair process" which, as you know, deals with the aging of a system which is maintained by replacing or restoring only a small percentage of its components. This will leave the system in approximately the same state (age) it was in just prior to each failure. To model this point process, we define an intensity function as the rate of change of the expected number of failures with respect to time which is also referred to as the renewal rate, failure intensity, peril rate, or the Rate Of oCurrece of Failures (ROCOF). This intensity function, which is an unconditional probability of failure within a certain interval, is often (again) confused with the hazard rate function, which is a conditional probability of failure within a certain interval given that the unit has survived to the beginning of the interval. The hazard rate function is a relative rate pertaining only to the first failure, whereas the intensity function is an absolute rate of failure for repairable systems.

One more cause of confusion...

In this case you case, you don't have a single MTBF but rather an interval MTBF which diminishes as the equipment ages.

Another cause of confusion...

Of course that MTBF has nothing to do with reliability but rather to maintainability. MTBF only influences availability. I agree.

This activity of ours is definitively a complicated one... Smiler

Regards,

Rui
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Thought I'd share a few other points of view on MTBF

Attached is an article from RELIABILITY®Magazine on Equations of RCM that contains a definition of MTBF

Also a link to wikipedia and what they say along with a link to an article cited in their backup information
http://en.wikipedia.org/wiki/MTTF

http://www.vicr.com/documents/quality/Rel_MTBF.pdf

I think this points to one of the main problems we have in our profession, too many definitions for the same terms. How can we all talk about something when we always have to define the term because we've modified it to fit our particular company's environment, usually to make someone look better than they really are.

I think that OEE and availability should only be measured over a one year period of time (except for looking for improvements and trends) using 24 hours 7 days a week because that's the way we run our businesses.

Just becausse you're not running 24/7 now for whatever reasons doesn't matter. OEE and availability should provide info on how much capacity you have and how much it could be improved or utilized if changes were made.
 
 
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Joe,
Thanks for the two references.
The wikipedia one says, and I quote
Reliability is quantified as MTBF (Mean Time Between Failures) for repairable product
and MTTF (Mean Time To Failure) for non-repairable product
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui,
quote:
The process that you refer is still more complicated

I regret I have to disagree with you. One can make anything as complicated as one wants, but I have outlined some basic principles, namely,
If an item starts every run in as good as new or AGAN condition, we compute MTTF.
If an item starts some runs with less than aGAN, we compute MTBF.
For both we use the same formula, i.e. time available to operate/number of failure events.
MTTR has absolutely nothing to do with reliability and hence MTBF.
Items that start each run with AGAN condition are termed non-repairable (even if they can in practice be repaired)
Items that start some runs with less than AGAN are called repairable.

I am not sure what is complicated in these statements. We dont need advanced maths to explain any of these to people.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Rui,
We should normally use the terms Failure Rate and MTBF in the constant hazard rate context(see Davidson, The Reliability of Mechanical Systems, ISBN 08529 8818). In all other cases Rate of Occurance of Failure is preferable (see Hoyland A, M. Rausand, System Reliability Theory, Wiley, ISBN 0471 593974.
Following this practive. MTBF does not vary with age.
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
Terry:

Wow! Lots of messages back and forth on simple subjects! Thanks for keeping me posted.

For the differences between MTBF & MTTF see: http://www.barringer1.com/TWMS.htm#MTBF_MTTF.

You can find tons of definitions and examples in MIL-HDBK-338 at http://www.barringer1.com/mil.htm

or specifically at http://www.barringer1.com/mil_files/MIL-HDBK-338.pdf

Most folks GET IT concerning availability. Unfortunately they quickly start to believe availability is THE answer to everything. Fewer folks GET IT concerning reliability.

To help people understand UNreliability (and it's complement, reliability) I'm thinking about offering a no-charge device that will Velcro over an automobile's master computer (on cars with a left side driver it's on the passenger side inside the cabin and near your ankle).

The GET IT device has a random clock. Once a day the clock will disable the computer for up to a 15 second interval. The "repair" can be accomplished by simply opening the hood and slamming it shut hard enough to "fix" the automobile and reset the clock for the next day. The short repair time will not significantly detract from availability of the automobile. But very quickly the driver will love to hate the repairs!

Perhaps the random failure of the automobile's computer will give the driver an appreciation of UNreliability and it's complement reliability. The key to reliability is to prevent failures. You cannot repair yourself to happiness.

I've published a couple of articles to my website that you may find of interest. See posted papers 35 & 36 at http://www.barringer1.com/Papers.htm . You'll like the SAP data in paper 35 and the corrosion data in paper 36.

----

Paul Barringer
Barringer & Associates, Inc.
Phone: 281-852-6810
FAX: 281-852-3749
Email: hpaul@barringer1.com
Web: http://www.barringer1.com
 
Like Like (0 likes)
PermalinkView Printer Friendly Format
 
Post Reply