|
small sample statistics
greetings,
i made the following measurements:
4.10, 3.98, 4.86, 4.39, 4.57, 4.44
i am being asked "what is the error." thery want to know what the +/- is on my numbers. i have averaged them (4.39) and said +/- 2sd. i am tracking changes - so i will make these same measurements under a different condition and want to know if they are statistically different. but at the new condition i will only have two values.
how can i answer this? and does it make a difference if i discard the first two results (procedural error).
thanks.
eng-tips forums is member supported.
what "error" are you trying to define: measurement error, material variability, process variability, ?? if you have 2 sets of data and want to determine whether the data sets are from the same population or are statistically different, then you can use a t-test. however, with this few of data points in each set (4 and 2), there will have to be a very large difference in the data set means for the difference to be significant. futher, the estimate of the population standard deviation with only 4 or 6 measurements is is not going to be very good, so to quote an "error" as +/-nsd is not likely to be useful.
calculate an average of all measurements (=4,39). then calculate (xi-xavg) for each measurement, sum the differences(-0,29;-0,41;0,47;0;0,18;0,05) and divide by the number of masurements(=6) to get 0,233.this is called (according to terminology in my books)"the average absolute error of measurement".divide it by the average(4,39)and multiply by 100 and this is relative percent error of measurement.
there is another way, namelly to estimate the inerval where the average will be found with a certain probability:calculate the average(=4,39) and stdev(=0,3187),
number of degrees of freedom n-1 (=5) and from tables find out the value of student`s t for double sided alfa=0,01(=3,36)and calculate the term t*stdev/sqrt(n-1)(=3,36*0,3187/2,36)=0,478.
so your average value will be 98% likely in the range 4,39-0,478...4,39+0,478.
you can find the student`s t in excel:
choose confidence limits cl[%]
calculate alfa=(100-cl)/100
call tinv(alfa, n-1)
m777182
thanks m777182. that looks good.
for swc.... here's the story. i have a single value at many (~50) conditions. i've been asked to repeat one condition ~10 times to "determine the scatter." soon i will have 12 values at condition x. the confidence limits and student't t-test is where i was leaning.
it is not practical to test each condition 10 times. but in this case it is reasonable to assume the multiple tests at condition x will be representative of all tests(literature confirms this).
the "scatter" you determine at condition x can be assumed to represent the scatter at the other 49 condtions, provided that the test measurement error is not a function of the conditions and the variability of the process is not a function of the conditions. both are assumptions that you have to validate based on your knowledge of the measurement method and the process that you are measuring.
i think that your problem calls for an approach that is known as experimental design. it will lead you to the best solution with the least expencive(=time consuming) sequence of steps and with the hihgest statistical confidence.
m777182 |
|