Thanks for the response, Khebab -- this is basically what I was asking. Do points prior to 50 Gb ever get used in the estimation. I guess I am just curious to how long you have to wait to choose a "linear" portion of the profile. Also, how effective is this if the peak hasn't occurred yet?

For instance, what if you started estimating with the first two observations and then updating your estimates as new observations came in. You would get wildly varing answers. Then, eventually, you would have to decide where to quit using early points to capture the linear part of the profile. In the graph above, if the estimation was done using the points between 40 Gb and 70 Gb, we would have estimated the cumulative Gb to be around 85.

In the end, I think this is a decent way to model existing data, but may be poor in making predictions of any accuracy. I just think we have to be careful on how this is presented -- especially when addressing scientifically-minded non-believers.

"Do points prior to 50 Gb ever get used in the estimation"

Look at this mash-up of Khebab's SA data and the USA data from my post above. The data points show more fluctuation for SA below 50 Gb, but they both show that curious hyperbolic curvature indicated by the solid blue line.

As I said in the post, this has to do with the use of power-law discovery as opposed to the exponential-law; the latter gives a perfectly flat HL.

I usually discard the first points because low cumulative production values (Q) will boost small fluctuations in production (P). I usually take P/Q<10% as a cut-off value. Because of integration, the noise on P does not affect Q after a while and fluctuations in P are dampened as Q increases.