1*4d495c6eSApple OSS Distributions#!/usr/bin/env python3 2*4d495c6eSApple OSS Distributions 3*4d495c6eSApple OSS Distributionsfrom fractions import Fraction 4*4d495c6eSApple OSS Distributionsfrom math import ceil 5*4d495c6eSApple OSS Distributionsfrom math import comb 6*4d495c6eSApple OSS Distributions 7*4d495c6eSApple OSS Distributions 8*4d495c6eSApple OSS Distributions# The inverse of 2, i.e. 2^-1. To be used as a base in exponentiations 9*4d495c6eSApple OSS Distributions# representing probabilities. 10*4d495c6eSApple OSS DistributionsINV2 = Fraction(1, 2) 11*4d495c6eSApple OSS Distributions 12*4d495c6eSApple OSS Distributions 13*4d495c6eSApple OSS Distributions# The probability of a false positive health test failure expressed as 14*4d495c6eSApple OSS Distributions# the negative logarithm of the *actual* probability. In simpler 15*4d495c6eSApple OSS Distributions# terms, the actual probability is: 16*4d495c6eSApple OSS Distributions# 17*4d495c6eSApple OSS Distributions# INV2 ** A 18*4d495c6eSApple OSS Distributions# 19*4d495c6eSApple OSS Distributions# It is simpler to keep this representation when computing the bound 20*4d495c6eSApple OSS Distributions# of the Repetition Count Test (below). 21*4d495c6eSApple OSS DistributionsA = 40 22*4d495c6eSApple OSS Distributions 23*4d495c6eSApple OSS Distributions 24*4d495c6eSApple OSS Distributions# The estimated min-entropy per sample in bits. Min-entropy is the 25*4d495c6eSApple OSS Distributions# negative logarithm of the probability of the *most likely* outcome. 26*4d495c6eSApple OSS Distributions# 27*4d495c6eSApple OSS Distributions# We consider this estimate to be conservative. 28*4d495c6eSApple OSS DistributionsH = 1 29*4d495c6eSApple OSS Distributions 30*4d495c6eSApple OSS Distributions 31*4d495c6eSApple OSS Distributions# The probability of the most likely outcome occurring in a given 32*4d495c6eSApple OSS Distributions# sample. This derives from the definition of min-entropy (see above). 33*4d495c6eSApple OSS DistributionsP = INV2 ** H 34*4d495c6eSApple OSS Distributions 35*4d495c6eSApple OSS Distributions 36*4d495c6eSApple OSS Distributions# 4.4.1 Repetition Count Test 37*4d495c6eSApple OSS Distributions# 38*4d495c6eSApple OSS Distributions# The Repetition Count Test (RCT) detects catastrophic failures in the 39*4d495c6eSApple OSS Distributions# noise source when it becomes "stuck" generating a single value over 40*4d495c6eSApple OSS Distributions# many consecutive samples. 41*4d495c6eSApple OSS Distributions# 42*4d495c6eSApple OSS Distributions# The probability of generating C consecutive identical samples is: 43*4d495c6eSApple OSS Distributions# 44*4d495c6eSApple OSS Distributions# P^(C-1) 45*4d495c6eSApple OSS Distributions# 46*4d495c6eSApple OSS Distributions# Or equivalently: 47*4d495c6eSApple OSS Distributions# 48*4d495c6eSApple OSS Distributions# 2^(-H * (C-1)) 49*4d495c6eSApple OSS Distributions# 50*4d495c6eSApple OSS Distributions# To keep this under our rate of acceptable false positives, we need 51*4d495c6eSApple OSS Distributions# to satisfy this inequality: 52*4d495c6eSApple OSS Distributions# 53*4d495c6eSApple OSS Distributions# 2^-A >= 2^(-H * (C-1)) 54*4d495c6eSApple OSS Distributions# 55*4d495c6eSApple OSS Distributions# Taking the logarithm of both sides, we have: 56*4d495c6eSApple OSS Distributions# 57*4d495c6eSApple OSS Distributions# -A >= -H * (C-1) 58*4d495c6eSApple OSS Distributions# 59*4d495c6eSApple OSS Distributions# Solving for C, we have: 60*4d495c6eSApple OSS Distributions# 61*4d495c6eSApple OSS Distributions# (A / H) + 1 >= C 62*4d495c6eSApple OSS Distributionsdef repetition_count_bound(): 63*4d495c6eSApple OSS Distributions return 1 + ceil(Fraction(A, H)) 64*4d495c6eSApple OSS Distributions 65*4d495c6eSApple OSS Distributions 66*4d495c6eSApple OSS Distributions# 4.4.2 Adaptive Proportion Test 67*4d495c6eSApple OSS Distributions# 68*4d495c6eSApple OSS Distributions# The Adaptive Proportion Test (APT) tries to detect more subtle noise 69*4d495c6eSApple OSS Distributions# source failures causing certain values to occur with unexpected 70*4d495c6eSApple OSS Distributions# frequency. It does this by taking a sample from the noise source and 71*4d495c6eSApple OSS Distributions# counting how many times the same sample occurs within a fixed-size 72*4d495c6eSApple OSS Distributions# window. 73*4d495c6eSApple OSS Distributions 74*4d495c6eSApple OSS Distributions 75*4d495c6eSApple OSS Distributions# The size of the window for non-binary alphabets for the APT. 76*4d495c6eSApple OSS DistributionsW = 512 77*4d495c6eSApple OSS Distributions 78*4d495c6eSApple OSS Distributions 79*4d495c6eSApple OSS Distributions# The probability mass function measuring the probability of exactly k 80*4d495c6eSApple OSS Distributions# occurrences of a given value within the observation window of size 81*4d495c6eSApple OSS Distributions# W. We use the probability of the most likely event (as above). 82*4d495c6eSApple OSS Distributions# 83*4d495c6eSApple OSS Distributions# There are three terms: 84*4d495c6eSApple OSS Distributions# 85*4d495c6eSApple OSS Distributions# 1. The binomial coefficient of k, i.e. W-choose-k. Simply, how many 86*4d495c6eSApple OSS Distributions# ways are there to get exactly k outcomes given W chances. 87*4d495c6eSApple OSS Distributions# 88*4d495c6eSApple OSS Distributions# 2. The probability of each of those k events occurring. 89*4d495c6eSApple OSS Distributions# 90*4d495c6eSApple OSS Distributions# 3. The probability that the other W-k events have some other 91*4d495c6eSApple OSS Distributions# outcome. 92*4d495c6eSApple OSS Distributionsdef pmf(k): 93*4d495c6eSApple OSS Distributions return comb(W, k) * P**k * (1 - P)**(W-k) 94*4d495c6eSApple OSS Distributions 95*4d495c6eSApple OSS Distributions 96*4d495c6eSApple OSS Distributions# The sum of probabilties of all possible counts of occurrences is 1. 97*4d495c6eSApple OSS Distributionsassert sum(map(pmf, range(W+1))) == 1 98*4d495c6eSApple OSS Distributions 99*4d495c6eSApple OSS Distributions 100*4d495c6eSApple OSS Distributions# We want to find the minimal count of occurrences such that the 101*4d495c6eSApple OSS Distributions# cumulative probability of seeing *at least* that count of 102*4d495c6eSApple OSS Distributions# occurrences (but possibly more) is no more than our false 103*4d495c6eSApple OSS Distributions# positive threshold. 104*4d495c6eSApple OSS Distributionsdef adaptive_proportion_bound(): 105*4d495c6eSApple OSS Distributions # The list of probabilities for each of the possible counts of 106*4d495c6eSApple OSS Distributions # occurrences. 107*4d495c6eSApple OSS Distributions probs = [pmf(x) for x in range(W+1)] 108*4d495c6eSApple OSS Distributions 109*4d495c6eSApple OSS Distributions # The list of cumulative distributions for each of the possible 110*4d495c6eSApple OSS Distributions # counts of occurrences. 111*4d495c6eSApple OSS Distributions # 112*4d495c6eSApple OSS Distributions # Whereas probs is a list of probabilities of *exactly* k 113*4d495c6eSApple OSS Distributions # occurrences, this is a list of probabilities of *k or more* 114*4d495c6eSApple OSS Distributions # occurrences. 115*4d495c6eSApple OSS Distributions # 116*4d495c6eSApple OSS Distributions # These are just sums of probabilities across a range of counts. 117*4d495c6eSApple OSS Distributions dists = [sum(probs[x:]) for x in range(W+1)] 118*4d495c6eSApple OSS Distributions 119*4d495c6eSApple OSS Distributions # Because we have constructed dists as an ordered list of 120*4d495c6eSApple OSS Distributions # cumulative probabilities, we can simply return the index of the 121*4d495c6eSApple OSS Distributions # first value that is below our threshold. 122*4d495c6eSApple OSS Distributions for i, d in enumerate(dists): 123*4d495c6eSApple OSS Distributions if d <= INV2**A: 124*4d495c6eSApple OSS Distributions return i 125*4d495c6eSApple OSS Distributions 126*4d495c6eSApple OSS Distributions 127*4d495c6eSApple OSS Distributionsdef main(): 128*4d495c6eSApple OSS Distributions print('Estimated min-entropy:', H) 129*4d495c6eSApple OSS Distributions print('False positive rate: 2^-{}'.format(A)) 130*4d495c6eSApple OSS Distributions print('Repetition Count Test bound:', repetition_count_bound()) 131*4d495c6eSApple OSS Distributions print('Adaptive Proportion Test bound:', adaptive_proportion_bound()) 132*4d495c6eSApple OSS Distributions 133*4d495c6eSApple OSS Distributions 134*4d495c6eSApple OSS Distributionsif __name__ == '__main__': 135*4d495c6eSApple OSS Distributions main() 136