Probability Distributions in Reliability
Probability distributions form the mathematical foundation for describing and predicting the failure behavior of electronic components and systems. These statistical models characterize how failure times are distributed across a population, enabling engineers to estimate reliability metrics, plan testing programs, and make informed design decisions. Selecting the appropriate distribution for a given application is essential for accurate reliability analysis.
Different failure mechanisms produce different patterns of failure times, and each probability distribution captures specific patterns. The exponential distribution describes random failures with constant failure rate. The Weibull distribution flexibly models infant mortality, random failure, and wear-out behavior. The lognormal distribution suits degradation and fatigue failures. Understanding these distributions and their applications equips reliability engineers to analyze data effectively and draw valid conclusions about product reliability.
Fundamental Reliability Functions
Before examining specific distributions, understanding the fundamental functions used to characterize reliability provides necessary foundation.
Probability Density Function
The PDF describes the distribution of failure times:
- Definition: f(t) represents the probability density of failure at time t
- Interpretation: f(t)dt gives probability of failure in small interval (t, t+dt)
- Normalization: Integral of f(t) from 0 to infinity equals 1
- Non-negative: f(t) greater than or equal to 0 for all t
- Shape: Shape of PDF reveals failure time distribution pattern
The PDF shows where failures are concentrated in time, revealing underlying failure behavior.
Cumulative Distribution Function
The CDF gives cumulative failure probability:
- Definition: F(t) = P(T less than or equal to t), probability of failure by time t
- Relationship: F(t) equals integral of f(x) from 0 to t
- Boundaries: F(0) = 0 and F(infinity) = 1
- Monotonicity: F(t) is non-decreasing; failures accumulate over time
- Unreliability: F(t) is also called the unreliability function
The CDF directly answers the question of what fraction of a population will have failed by a given time.
Reliability Function
The reliability function gives survival probability:
- Definition: R(t) = P(T > t), probability of surviving beyond time t
- Relationship: R(t) = 1 - F(t)
- Boundaries: R(0) = 1 (all units start operational) and R(infinity) = 0 (all eventually fail)
- Decreasing: R(t) decreases as time increases; reliability declines with age
- Key metric: R(t) is often the primary reliability specification
The reliability function provides the probability that a unit will still be operational at any given time.
Hazard Function
The hazard function describes instantaneous failure rate:
- Definition: h(t) = f(t)/R(t), conditional failure rate at time t given survival to t
- Interpretation: h(t)dt gives probability of failure in next dt given survival to t
- Units: Expressed in failures per unit time (e.g., failures per hour)
- Shape significance: Increasing, constant, or decreasing hazard indicates wear-out, random, or infant mortality
- Cumulative hazard: H(t) = integral of h(x) from 0 to t; R(t) = exp(-H(t))
The hazard function reveals how failure risk changes with age, indicating underlying failure mechanisms.
The Exponential Distribution
The exponential distribution is the simplest and most widely used distribution in reliability engineering, characterized by a constant failure rate.
Distribution Functions
Key functions for the exponential distribution:
- PDF: f(t) = lambda * exp(-lambda*t) for t >= 0
- CDF: F(t) = 1 - exp(-lambda*t)
- Reliability: R(t) = exp(-lambda*t)
- Hazard: h(t) = lambda (constant)
- Parameter: lambda is the failure rate (failures per unit time)
The single parameter lambda completely characterizes the exponential distribution.
Key Properties
The exponential distribution has distinctive properties:
- Constant failure rate: The probability of failure in the next time interval is independent of age
- Memoryless property: P(T > t+s | T > t) = P(T > s); past survival does not affect future
- Mean: MTTF = 1/lambda
- Standard deviation: Also equals 1/lambda; coefficient of variation equals 1
- Median: t_50 = ln(2)/lambda approximately equals 0.693/lambda
The memoryless property means equipment is as good as new at any age, which may or may not reflect reality.
Applications and Limitations
When to use and when to avoid the exponential distribution:
- Random failures: Appropriate when failures result from random external events or overstress
- Complex systems: Superposition of many failure mechanisms may produce approximately constant rate
- Useful life region: Valid during the constant failure rate portion of bathtub curve
- Not for wear-out: Inappropriate when failure rate increases with age
- Not for infant mortality: Inappropriate when failure rate decreases with age
The exponential distribution is often assumed for convenience but should be validated against actual data.
Reliability Calculations
Common calculations using the exponential distribution:
- Reliability at time t: R(t) = exp(-lambda*t) = exp(-t/MTTF)
- Failure probability: F(t) = 1 - exp(-lambda*t)
- Time for given reliability: t = -ln(R)/lambda
- MTTF from test: MTTF = total operating time / number of failures
- Confidence bounds: Chi-square based bounds for failure rate and MTTF
The mathematical simplicity of exponential calculations makes it attractive for preliminary analysis.
The Weibull Distribution
The Weibull distribution is the most versatile distribution in reliability engineering, capable of modeling decreasing, constant, and increasing failure rates.
Distribution Functions
Key functions for the two-parameter Weibull distribution:
- PDF: f(t) = (beta/eta) * (t/eta)^(beta-1) * exp(-(t/eta)^beta)
- CDF: F(t) = 1 - exp(-(t/eta)^beta)
- Reliability: R(t) = exp(-(t/eta)^beta)
- Hazard: h(t) = (beta/eta) * (t/eta)^(beta-1)
- Parameters: beta (shape) and eta (scale, characteristic life)
The two parameters provide flexibility to fit diverse failure patterns.
Shape Parameter Interpretation
The beta parameter determines failure rate behavior:
- Beta less than 1: Decreasing failure rate; indicates infant mortality
- Beta equals 1: Constant failure rate; reduces to exponential distribution
- Beta greater than 1: Increasing failure rate; indicates wear-out
- Beta equals 2: Linear increasing hazard; Rayleigh distribution
- Beta equals 3.4: Approximates normal distribution shape
The shape parameter directly indicates whether the failure mode is infant mortality, random, or wear-out.
Scale Parameter and Characteristic Life
The eta parameter determines when failures occur:
- Definition: Eta is the time at which 63.2% of population has failed
- Reliability at eta: R(eta) = exp(-1) approximately equals 0.368
- Scale effect: Changing eta stretches or compresses the time axis
- Mean relationship: Mean = eta * Gamma(1 + 1/beta)
- B10 life: B10 = eta * (-ln(0.90))^(1/beta)
The characteristic life provides a reference point for comparing reliability across products.
Three-Parameter Weibull
Adding a location parameter extends flexibility:
- Third parameter: Gamma represents minimum life before failures can occur
- Modified functions: Replace t with (t-gamma) in all formulas
- Threshold effect: No failures occur before time gamma
- Physical interpretation: Represents guaranteed minimum life
- Estimation challenge: Three parameters are harder to estimate reliably
Use the three-parameter Weibull when physical reasoning suggests a failure-free period.
Weibull Plotting
Graphical analysis reveals distribution fit and parameters:
- Weibull paper: Log-log plot linearizes Weibull CDF
- Y-axis: ln(ln(1/(1-F(t)))) or equivalent Weibull scale
- X-axis: ln(t) or logarithmic time scale
- Slope: Slope of fitted line equals beta
- Intercept: Eta read from time axis at F = 63.2%
Weibull plotting provides visual assessment of fit and quick parameter estimation.
The Lognormal Distribution
The lognormal distribution models failures resulting from multiplicative degradation processes and is particularly useful for semiconductor and fatigue applications.
Distribution Functions
Key functions for the lognormal distribution:
- PDF: f(t) = (1/(t*sigma*sqrt(2*pi))) * exp(-(ln(t)-mu)^2/(2*sigma^2))
- CDF: F(t) = Phi((ln(t)-mu)/sigma), where Phi is standard normal CDF
- Reliability: R(t) = 1 - Phi((ln(t)-mu)/sigma)
- Hazard: h(t) = f(t)/R(t); initially increases then decreases
- Parameters: mu (log-mean) and sigma (log-standard deviation)
The parameters describe the distribution of ln(t), which is normally distributed.
Key Properties
Distinctive characteristics of the lognormal distribution:
- Right skewed: Long tail toward longer times; mean exceeds median
- Median: t_50 = exp(mu); the median equals the geometric mean
- Mean: Mean = exp(mu + sigma^2/2)
- Mode: Mode = exp(mu - sigma^2)
- Hazard shape: Hazard function increases then decreases; not monotonic
The non-monotonic hazard distinguishes the lognormal from Weibull for wear-out applications.
Physical Basis
The lognormal arises from multiplicative processes:
- Degradation processes: Cumulative damage from many small multiplicative effects
- Fatigue: Crack growth governed by multiplicative increments
- Electromigration: Time to failure follows lognormal in many studies
- Dielectric breakdown: Oxide breakdown often fits lognormal distribution
- Central limit theorem: Product of many random variables tends toward lognormal
When failure results from accumulated multiplicative effects, lognormal is theoretically justified.
Comparison with Weibull
Understanding when to use lognormal versus Weibull:
- Hazard behavior: Weibull hazard is monotonic; lognormal hazard peaks and decreases
- Tail behavior: Lognormal has heavier right tail than most Weibull shapes
- Parameter interpretation: Weibull parameters more directly interpretable
- Extrapolation: Both can give very different predictions when extrapolating
- Mechanism basis: Choose based on physical failure mechanism when possible
For short extrapolations, Weibull and lognormal often fit data equally well; mechanism knowledge guides selection.
The Normal Distribution
The normal (Gaussian) distribution has limited direct application in reliability but appears in certain contexts and underpins many statistical methods.
Distribution Functions
Key functions for the normal distribution:
- PDF: f(t) = (1/(sigma*sqrt(2*pi))) * exp(-(t-mu)^2/(2*sigma^2))
- CDF: F(t) = Phi((t-mu)/sigma)
- Reliability: R(t) = 1 - Phi((t-mu)/sigma)
- Hazard: h(t) = f(t)/R(t); increases without bound
- Parameters: mu (mean) and sigma (standard deviation)
The symmetric bell shape is familiar but not typical of reliability applications.
Limitations for Reliability
The normal distribution has significant limitations:
- Negative values: Normal allows negative failure times, which are physically impossible
- Symmetric: Failure time distributions are typically right-skewed, not symmetric
- Unbounded hazard: Hazard increases without bound, which may not be realistic
- Limited flexibility: Fixed shape provides no flexibility unlike Weibull
- Approximation only: May approximate tight wear-out distributions but rarely best choice
Use normal distribution cautiously and only when justified by data or physical reasoning.
Valid Applications
Contexts where normal distribution applies:
- Wear-out with tight spread: When wear-out occurs in narrow time range relative to mean
- Parameter variation: Component parameter distributions often normal
- Degradation amount: Amount of degradation at fixed time may be normal
- Log-transformed data: If ln(t) is normal, t is lognormal
- Statistical inference: Underpins confidence intervals and hypothesis tests
Normal distribution remains important for statistical methods even when failure times are not normal.
Other Distributions
Additional distributions address specific reliability applications beyond the primary four.
Gamma Distribution
The gamma distribution generalizes the exponential:
- Parameters: Shape parameter k and rate parameter lambda (or scale theta = 1/lambda)
- Special cases: Exponential when k=1; Erlang when k is integer
- Sum of exponentials: Sum of k exponential times follows Erlang distribution
- Hazard: Increasing, constant, or decreasing depending on k
- Applications: Queuing models, system with stages of failure
The gamma distribution is mathematically related to the Weibull but has different properties.
Inverse Gaussian Distribution
Models first passage time to a threshold:
- Physical model: Time for degradation to reach failure threshold with drift and diffusion
- Parameters: Mean mu and shape lambda
- Hazard: Initially increases rapidly then approaches constant
- Applications: Degradation modeling, wear processes
- Relationship: Related to Wiener process with drift
Inverse Gaussian has theoretical basis for degradation-to-threshold failure models.
Birnbaum-Saunders Distribution
Another degradation-based distribution:
- Origin: Derived from fatigue crack growth under cyclic loading
- Parameters: Shape alpha and scale beta
- Relationship: Related to normal distribution through transformation
- Hazard: Initially increasing, eventually approaches constant
- Applications: Material fatigue, crack propagation
Birnbaum-Saunders provides physically-motivated alternative to lognormal for fatigue applications.
Mixed Distributions
Mixtures model populations with multiple failure modes:
- Concept: Overall distribution is weighted combination of component distributions
- Defective subpopulation: Mix of early-failure defectives and normal units
- Multiple mechanisms: Different failure modes with different distributions
- Parameter estimation: More complex; may require EM algorithm or numerical methods
- Identification: Curvature on Weibull plots may indicate mixed population
Mixed distributions capture real-world situations where multiple distinct failure populations exist.
Distribution Selection and Validation
Choosing the right distribution and validating the choice is critical for accurate reliability analysis.
Selection Criteria
Factors guiding distribution selection:
- Physical mechanism: Select distribution consistent with known failure physics
- Failure rate pattern: Choose distribution matching observed hazard behavior
- Data fit: Select distribution that best fits available failure data
- Extrapolation needs: Consider extrapolation behavior if predicting beyond data range
- Precedent: Use distributions proven for similar applications
Physical understanding should guide selection when possible; statistical fit alone can be misleading.
Goodness of Fit Tests
Statistical tests evaluate distribution fit:
- Chi-square test: Compare observed and expected frequencies in intervals
- Kolmogorov-Smirnov test: Maximum difference between empirical and theoretical CDFs
- Anderson-Darling test: Weighted K-S test emphasizing tail fit
- Correlation coefficient: Correlation on probability plot indicates fit quality
- Likelihood ratio: Compare likelihoods of competing distributions
Formal tests quantify fit quality but should be complemented by graphical assessment.
Graphical Methods
Visual assessment reveals fit characteristics:
- Probability plots: Data should follow straight line on appropriate probability paper
- Hazard plots: Plot estimated hazard against theoretical hazard function
- Q-Q plots: Quantile-quantile plots compare data and theoretical quantiles
- Residual plots: Examine residuals for systematic patterns
- Multiple distributions: Compare same data on different probability papers
Graphical methods reveal patterns and outliers that numerical tests may miss.
Sensitivity Analysis
Evaluate impact of distribution choice:
- Compare predictions: Calculate reliability metrics with different distributions
- Extrapolation sensitivity: Assess how far predictions diverge beyond data range
- Parameter uncertainty: Propagate parameter uncertainty through predictions
- Consequence assessment: Determine if distribution choice matters for decisions
- Conservative choice: When uncertain, choose distribution giving conservative predictions
If different reasonable distributions give similar predictions, distribution choice may not be critical.
Summary
Probability distributions provide the mathematical framework for describing how failure times are distributed across product populations. The exponential distribution with its constant failure rate offers simplicity and is appropriate for random failures during useful life. The Weibull distribution provides flexibility to model infant mortality, random failure, and wear-out behavior through its shape parameter. The lognormal distribution suits degradation and fatigue mechanisms arising from multiplicative processes. The normal distribution has limited direct reliability application but underpins many statistical methods.
Each distribution has characteristic properties that make it suitable for specific applications. Understanding these properties enables proper distribution selection based on physical failure mechanisms, data characteristics, and analysis requirements. The hazard function is particularly informative: constant hazard suggests exponential, decreasing hazard indicates infant mortality with Weibull beta less than one, and increasing hazard indicates wear-out with Weibull beta greater than one or potentially lognormal.
Distribution selection should be guided by physical understanding when possible, supported by statistical fit assessment. Graphical methods complement formal goodness-of-fit tests by revealing patterns and anomalies. Sensitivity analysis determines whether distribution choice significantly impacts conclusions. When properly selected and validated, probability distributions enable accurate reliability prediction, informed design decisions, and effective reliability demonstration testing.