Application of the Dispersive Discovery Model
Posted by Khebab on November 27, 2007 - 11:00am
Topic: Supply/Production
Tags: Dispersive discovery model, lower 48, m. king hubbert, Shock model [list all tags]
This is a guest post by WebHubbleTelescope.
Sometimes I get a bit freaked out by the ferocity and doggedness of the so-called global warming deniers. The latest outpost for these contrarians, climateaudit.org, shows lots of activity, with much gnashing over numbers and statistics, with the end result that they get a science blog of the year award (a 1st place tie actually). Fortunately, the blog remains open to commenting from both sides of the GW argument, which if nothing else makes it a worthy candidate for some type of award. Even though I don't agree with the nitpicking approach of the auditors, they do provide peer review for strengthening one's arguments and theories. I can only hope that this post on oil depletion modeling attracts the same ferocity from the peak oil deniers out there. Unfortunately, we don't have a complementary "oil depletion audit" site organized yet (though Stuart and Khebab, et al, seem to be working on it--see yesterday's post), so we have to rely on the devil's advocates on TOD to attempt to rip my Application of the Dispersive Discovery Model to shreds. Not required, but see my previous post Finding Needles in a Haystack to prime your pump.
![]() |
A Premise In Regards To The Pyramid
I start with one potentially contentious statement: roughly summarized
as "big oil reserves are NOT necessarily found first".
I find this rather important in terms of the recent work that Khebab and Stuart
have posted. As Khebab notes "almost half of the
world production is coming from less than 3% of the total number of
oilfields". So the intriguing notion remains, when
do these big oil finds occur, and can our rather limited understanding
of the discovery dynamics provide the Black Swan moment1 that the PO denialists
hope for? For a moment, putting on the hat of a denier, one
could argue that we have no knowledge as to whether we have found all
the super-giants, and that a number of these potentially remain,
silently lurking in the shadows and just waiting to get discovered.
From the looks of it, the USGS has some statistical confidence that
these exist and can make a substantial contribution to future reserves.
Khebab has done interesting work in duplicating the USGS results
with the potential for large outliers -- occurring primarily from the
large variance in field sizes provided by the log-normal field size distribution
empirically observed.
But this goes against some of the arguments I have seen on TOD
which revolve around some intuitive notions and conventional wisdom of
always finding big things first. Much like the impossibility of
ignoring the elephant in the room, the logical person would infer that
of course we would find big things first. This argument has
perhaps tangential scientific principles behind it, mainly in
mathematical strategies for dealing with what physicists call
scattering cross-sections and the like. Scientifically based or not, I
think people basically latch on to this idea without much thought.
But I have still have problems with the conventional
contention, primarily in understanding what would cause us to uniformly
find big oil fields first. On the one hand, and in historic terms,
early oil prospectors had no way of seeing everything under the earth;
after all, you can only discover what you can see (another bit of
conventional wisdom). So this would imply as we probe deeper
and cast a wider net, we still have a significant chance of discovering
large oil deposits. After all, the mantle of the earth remains a rather
large volume.
On the the same hand, the data does not convincingly back up
the early discovery model. Khebab's comment section noted the work of Robelius.
Mr. Robelius dedicated graduate thesis work to tabulating the portion
of discoveries due to super-giants and it does in fact appear to skew
to earlier years than the overall discovery data. However, nothing
about the numbers of giant oil fields
found appears skewed about the peak as shown in Figure 2
below:

Figure 2: Robelius data w/ASPO total superimposed
![]() |
| Figure 3: Discovery data of unknown origins |
![]() |
As is typical of discovery data, I do spot some
inconsistencies in the chart as well. I superimposed a chart provide by Gail of total discoveries due to ASPO
on top of the Robelius data and it appears we have an inversion or two
(giants > total in the 1920's and 1930's). Another graph from
unknown origins (Figure 3) has the same 62% number
that Robelius quotes for big oil contribution. Note that the number of
giants before 1930 probably all gets lumped at 1930. It still
looks inconclusive whether a substantial number of giants occurred
earlier or whether we can attach any statistical significance to the
distribution.
The controversial "BOE" discovery data provided by Shell
offers up other supporting evidence for a more uniform distribution of
big finds. As one can see in Figure 4 due to some
clearly marked big discoveries in the spikes at 1970 and 1990, the
overall discovery ordering looks a bit more stationary.
Unfortunately, I have a feeling that the big finds marked come about
from unconventional sources. Thus, you begin to understand the
more-or-less truthful BOE="barrel of oil equivalent" in small lettering
on the y-axis (wink, wink). And I really don't understand what their
"Stochastic simulation" amounts to -- a simple moving average perhaps?
--- Shell Oil apparently doesn't have to disclose their methods (wink,
wink, wink).
Given the rather inconclusive evidence, I contend that I can
make a good conservative assumption that the size of discoveries
remains a stationary property of any oil discovery model. This has some
benefits in that the conservative nature will suppress the pessimistic
range of predictions, leading to a best-case estimate for the
future. Cornucopians say that we will still find big
reservoirs of oil somewhere. Pessimists say that historically we have
always found the big ones early.
In general, the premise assumes no bias in terms of when we
find big oil, in other words we have equal probability of finding a big
one at any one time.
Two Peas to the Pod
For my model of oil depletion I intentionally separate the Discovery
Model from the Production Model. This differs from the
unitarians who claim that a single equation, albeit a heuristic one
such as the Logistic, can effectively model the dynamics of
oil depletion. From my point-of-view, the discovery process remains
orthogonal to the subsequent extraction/production process, and that
the discovery dynamics acts as a completely independent stimulus to
drive the production model. I contend that the two convolved
together give us a complete picture of the global oil depletion process.
![]() |
As for the Production Model, I continue to stand by the Oil Shock Model as a valid pairing to the Dispersive Discovery model. The Shock Model will take as a forcing function basically any discovery data, including real data or, more importantly, a model of discovery. The latter allows us to make the critical step in using the Shock Model for predictive purposes. Without the extrapolated discovery data that a model will provide, the Shock Model peters out with an abrupt end to forcing data, which usually ends up at present time (with no reserve growth factor included).
As for the main premise behind the Shock Model, think in terms
of rates acting on volumes of found material. To 1st-order,
the depletion of a valuable commodity scales proportionately to the
amount of that commodity on hand. Because of the stages that oil goes
through as it starts from a fallow, just-discovered reservoir, one can
apply the Markov-rate law through each of the stages. The Oil
Shock Model essential acts as a 4th-order low
pass filter and removes much of the fluctuations introduced by a noisy
discovery process (see next section). The "Shock" portion
comes about from perturbations applied to the last stage of extraction,
which we can use to model instantaneous socio-political
events. I know the basic idea behind the Oil Shock Model has
at least some ancestry; take a look at "compartmental models" for
similar concepts, although I don't think anyone has seriously applied
it to fossil fuel production and nothing yet AFAIK in terms of the
"shock" portion (Khebab has since applied it to a hybrid model).
Dispersive Discovery and Noise
![]() |
The shape of the curve that Jerry found due to Hubbert has the characteristic of a cumulative dispersive swept region in which we remove the time dependent growth term, retaining the strictly linear mapping needed for the histogram, see the n=1 term in Figure 7 below:

Figure 7: Order n=1 gives the cumulative swept volume mapped linearly to time
For the solution, we get:
wheredD/dh = c * (1-exp(-k/h)*(1+k/h))
h denotes the cumulative depth.I did a quickie overlay with a scaled dispersive profile, which shows the same general shape (Figure 8).

Figure 8: Hubbert data mapping delta discoveries to cumulative drilled footage
The
k term has significance in terms of an
effective URR as I described in the dispersive
discovery model post. I eyeballed the scaling as k=0.7e9 and c=250, so I
get 175 instead of the 172 that Hubbert got.To expand in a bit more detail, the basic parts of the derivation that we can substantiate involve the L-bar calculation in the equations in Figure 9 below (originally from):

Figure 9: Derivation of the Dispersed Discovery Model
The key terms include lambda, which indicates cumulative footage, and the L-bar, which denotes an average cross-section for discovery for that particular cumulative footage. This represents Stage-1 of the calculation -- which I never verified with data before -- while the last lines labeled "Linear Growth" and "Parabolic Growth" provide examples of modeling the Stage-2 temporal evolution.
Since the results come out naturally in terms of cumulative discovery, it helps to integrate Hubbert's yearly discovery curves. So Figure 10 below shows the cumulative fit paired with the yearly (the former is an integral of the latter):
![]() |
![]() |
I did a least-squares fit to the curve that I eyeballed initially and the discovery asymptote increased from my estimated 175 to 177. I've found that generally accepted values for this USA discovery URR ranges up to 195 billion barrels in the 30 years since Hubbert published this data. This, in my opinion, indicates that the model has potential for good predictive power.
![]() |
![]() |
![]() |
Although a bit unwieldy, one can linearize the dispersive discovery curves, similar to what the TOD oil analysts do with Hubbert Linearization. In Figure 13, although it swings wildly initially, I can easily see the linear agreement, with a correlation coefficient very nearly one and a near zero extrapolated y-intercept. (note that the naive exponential that Hubbert used in Figure 11 for NG overshoots the fit to better match the asymptote but still falls short of the alternative model's asymptote, and which also fits the bulk of the data points much better)
![]() |
![]() |
Every bit of data tends to corroborate that the dispersive discovery model works quite effectively in both providing an understanding on how we actually make discoveries in a reserve growth fashion and in mathematically describing the real data.
So at a subjective level, you can see that the cumulative ultimately shows the model's strengths, both from the perspective of the generally good fit for a 2-parameter model (asymptotic value + cross section efficiency of discovery), but also in terms of the creeping reserve growth which does not flatten out as quickly as the exponential does. This slow apparent reserve growth matches empirical-reality remarkably well. In contrast, the quality of Hubbert's exponential fit appears way off when plotted in the cumulative discovery profile, only crossing at a few points and reaching an asymptote well before the dispersive model does.
But what also intrigued me is the origin of noise in the discovery data and how the effects of super fields would affect the model. You can see the noise in the cumulative plots from Hubbert above (see Figures 6 & 11 even though these also have a heavy histogram filter applied) and also particularly in the discovery charts from Laherrere in Figure 14 below.

Figure 14: Unfiltered discovery data from Laherrere
If you consider that the number of significant oil discoveries runs in the thousands according to The Pyramid (Figure 1), you would think that noise would abate substantially and the law of large numbers would start to take over. Alas, that does not happen and large fluctuations persist, primarily because of the large variance characteristic of a log-normal size distribution. See Khebab's post for some extra insight into how to apply the log-normal, and also for what I see as a fatal flaw in the USGS interpretation that the log-normal distribution necessarily leads to a huge uncertainty in cumulative discovery in the end. From everything I have experimented with, the fluctuations do average out in the cumulative sense, if you have a dispersive model underlying the analysis, of which the USGS unfortunately leave out.
The following pseudo-code maps out the Monte Carlo algorithm I
used to generate statistics (this uses the standard trick for
inverting
an exponential distribution and a more detailed one for
inverting the Erf() which results from the
cumulative Log-Normal distribution). This algorithm draws on
the initial premise that fluctuations in discovering is basically a
stationary process, and remains the same over the duration of
discovery.
![]() |
Basic algorithmic steps:1 for Count in 1..Num_Paths loop
Lambda (Count) := -Log (Rand);
end loop;
2 while H < Depth loop
H := H + 1.0;
Discovered := 0;
3 for Count in 1 .. Num_Paths loop
4 if H * Lambda(Count) < L0 then
5 LogN := exp(Sigma*Inv(Rand))/exp(Sigma*Sigma/2.0);
6 Discovered := Discovered + Lambda(Count) * LogN;
end if;
end loop;
7 -- Print H + Discovered/Depth or Cumulative Discoveries
end loop;
- Generate a dispersed set of paths that consist of random lengths normalized to a unitary mean.
- Start increasing the mean depth until we reach some
artificial experimental limit (much larger than L0).
- Sample each path within the set.
- Check if the scaled dispersed depth is less than the estimated maximum depth or volume for reservoirs, L0.
- Generate a log-normal size proportional to the
dimensionless dispersive variable Lambda
- Accumulate the discoveries per depth
- If you want to accumulate over all depths, you will get
something that looks like Figure 15.
The series of MC experiments in Figures 16-22 apply various size sampling distributions to the Dispersive Discovery Monte Carlo algorithm4. For both a uniform size distribution and exponential damped size distribution, the noise remains small for sample sets of 10,000 dispersive paths. However, by adding a log-normal size distribution with a large variance (log-sigma=3), the severe fluctuations become apparent for both the cumulative depth dynamics and particularly for the yearly discovery dynamics. This, I think, really explains why Laherrere and other oil-depletion analysts like to put the running average on the discovery profiles. I say, leave the noise in there, as i contend that it tells us a lot about the statistics of discovery.

Figure 16: Dispersive Discovery Model mapped into Hubbert-style cumulative efficiency. The Monte Carlo simulation in this case is only used to verify the closed-form solution as a uniform size distribution adds the minimal amount of noise, which is sample size limited only.

Figure 17: Dispersive Discovery Model with Log-Normal size distribution. This shows increased noise for the same sample size of N=10000.

Figure 18: Same as Fig. 17, using a different random number seed

Figure 19: Dispersive Discovery Model assuming uniform size distribution

Figure 20: Dispersive Discovery Model assuming log-normal size distribution

Figure 21: Dispersive Discovery Model assuming log-normal size distribution. Note that sample path size increased by a factor of 100 from Figure 20. This reduces the fluctuation noise considerably.

Figure 22: Dispersive Discovery Model assuming exponentially damped size distribution. The exponential has a much narrower variance than the log-normal.
I figure instead of filtering the data via moving averages, it might make more sense to combine discovery data from different sources and use that as a noise reduction/averaging technique. Ideally I would also like to use a cumulative but that suffers a bit from not having any pre-1900 discovery data.

Figure 23: Discovery Data plotted with minimal filtering

Figure 24: Discovery Data with a 3-year moving average
Application of the Dispersive
Discovery + Oil Shock Model to Global Production
In Figure 2, I overlaid a Dispersive Discovery fit to the data. In this section of the post, I explain the rational for the parameter selection and point out a flaw in my original assumption when I first tried to fit the Oil Shock Model a couple of years ago.
Jean Laherrere of ASPO France last year presented a paper entitled "Uncertainty on data and forecasts". A TOD commenter had pointed out the following figures from Pp.58 and 59:

Figure 25: World Crude Discovery Data

Figure 26: World Crude Discovery Data
I eventually put two and two together and realized that the NGL portion of the data really had little to do with typical crude oil discoveries; as finding oil only occasionally coincides with natural gas discoveries. Khebab has duly noted this as he always references the Shock Oil model with the caption "Crude Oil + NGL". Taking the hint, I refit the shock model to better represent the lower peak of crude-only production data. This essentially scales back the peak by about 10% as shown in the second figure above. I claim a mixture of ignorance and sloppy thinking for overlooking this rather clear error.
So I restarted with the assumption that the discoveries comprised only crude oil, and any NGL would come from separate natural gas discoveries. This meant that that I could use the same discovery model on discovery data, but needed to reduce the overcompensation on extraction rate to remove the "phantom" NGL production that crept into the oil shock production profile. This essentially will defer the peak because of the decreased extractive force on the discovered reserves.
I fit the discovery plot by Laherrere to the dispersive discovery model with a cumulative limit of 2800 GB and a cubic-quadratic rate of 0.01 (i.e n=6 for the power-law). This gives the blue line in Figure 27 below.

Figure 27: Discovery Data + Shock Model for World Crude
For the oil shock production model, I used {fallow,construction,maturation} rates of {0.167,0.125,0.1} to establish the stochastic latency between discovery and production. I tuned to match the shocks via the following extraction rate profile:

Figure 28: Shock profile associated with Fig.27
As a bottom-line, this estimate fits in between the original oil shock profile that I produced a couple of years ago and the more recent oil shock model that used a model of the perhaps more optimistic Shell discovery data from earlier this year. I now have confidence that the discovery data by Shell, which Khebab had crucially observed had the cryptic small print scale "boe" (i.e. barrels of oil equivalent), should probably better represent the total Crude Oil + NGL production profile. Thus, we have the following set of models that I alternately take blame for (the original mismatched model) and now dare to take credit for (the latter two).
Original Model(peak=2003)
< No NGL(peak=2008)
< Shell data of BOE(peak=2010)
I still find it endlessly fascinating how the peak position of the models do not show the huge sensitivity to changes that one would expect with the large differences in the underlying URR. When it comes down to it, shifts of a few years don't mean much in the greater scheme of things. However, how we conserve and transition on the backside will make all the difference in the world.
Production as Discovery?
In the comments section to the dispersive oil discovery model post, Khebab applied the equation to USA data. As the model should scale from global down to distinct regions, these kinds of analyses provide a good test to the validity of the model.
In particular, Khebab concentrated on the data near the peak position to ostensibly try to figure out the potential effects of reserve growth on reported discoveries. He generated a very interesting preliminary result which deserves careful consideration (if Khebab does not pursue this further, I definitely will). In any case, it definitely got me going to investigate data from some fresh perspectives. For one, I believe that the Dispersive Discovery model will prove useful for understanding reserve growth on individual reservoirs, as the uncertainty in explored volume plays in much the same way as it does on a larger scale. In fact I originally proposed a dispersion analysis on a much smaller scale (calling it Apparent Reserve Growth) before I applied it to USA and global discoveries.
As another example, after grinding away for awhile on the available USA production and discovery data, I noticed that over the larger range of USA discoveries, i.e. inferring from production back to 1859, the general profile for yearly discoveries would not affect the production profile that much on a semi-log plot. The shock model extraction model to first order shifts the discovery curve and broadens/scales the peak shape a bit -- something fairly well understood if you consider that the shock model acts like a phase-shifting IIR filter. So on a whim, and figuring that we may have a good empirical result, I tried fitting the USA production data to the dispersive discovery model, bypassing the shock model response.
I used the USA production data from EIA which extends back to 1859 and to the first recorded production out of Titusville, PA of 2000 barrels (see for historical time-line). I plotted this in Figure 29 on a semi-log plot to cover the substantial dynamic range in the data.

Figure 29: USA Production mapped as a pure Discovery Model
This curve used the n=6 equation, an initial t_0 of 1838, a value for k of 0.0000215 (in units of 1000 barrels to match EIA), and a Dd of 260 GB.
D(t) = kt6*(1-exp(-Dd/kt6))The peak appears right around 1971. I essentially set P(t) = dD(t)/dt as the model curve.
dD(t)/dt = 6kt5*(1-exp(-Dd/kt6)*(1+Dd/kt6))
![]() |
Stuart Staniford of TOD originally tried to fit the USA curve on a semi-log plot, and had some arguable success with a Gaussian fit. Over the dynamic range, it fit much better than a logistic, but unfortunately did not nail the peak position and didn't appear to predict future production. The gaussian also did not make much sense apart from some hand-wavy central limit theorem considerations.
Even before Staniford, King Hubbert gave the semi-log fit a try and perhaps mistakenly saw an exponential increase in production from a portion of the curve -- something that I would consider a coincidental flat part in the power-law growth curve.

Figure 31: World Crude Discovery Data
Conclusions
The Dispersive Discovery model shows promise at describing:- Oil and NG discoveries as a function of cumulative depth.
- Oil discoveries as a function of time through a power-law
growth term.
- Together with a Log-Normal size distribution, the
statistical fluctuations in discoveries. We can easily represent the
closed-form solution in terms of a Monte Carlo algorithm.
- Together with the Oil Shock Model, global crude oil production.
- Over a wide dynamic range, the overall production shape.
Look at USA production in historical terms for a good example.
- Reserve growth of individual reservoirs.
References
1 "The Black Swan: The Impact of the Highly Improbable" by Nassim Nicholas Taleb. The discovery of a black swan occurred in Australia, which no one had really explored up to that point. The idea that huge numbers of large oil reservoirs could still get discovered presents an optical illusion of sorts. The unlikely possibility of a huge new find hasn't as much to do with intuition, as to do with the fact that we have probed much of the potential volume. And the maximum number number of finds occur at the peak of the dispersively swept volume. So the possibility of finding a Black Swan becomes more and more remote after we explore everything on earth.
2 These same charts show up in an obscure Fishing Facts article dated 1976, where the magazine's editor decided to adapt the figures from a Senate committee hearing that Hubbert was invited to testify to.
Fig. 5 Average discoveries of crude oil per loot lor each 100 million feet of exploratory drilling in the U.S. 48 states and adjacent continental shelves. Adapted by Howard L. Baumann of Fishing Facts Magazine from Hubbert 1971 report to U.S. Senate Committee. "U.S. Energy Resources, A Review As Of 1972." Part I, A Background Paper prepared at the request of Henry M. Jackson, Chairman: Committee on Interior and Insular Affairs, United States Senate, June 1974.Like I said, this stuff is within arm's reach and has been, in fact, staring at us in the face for years.
Fig.6 Estimation of ultimate total crude oil production for the U.S. 48 states and adjacent continental shelves; by comparing actual discovery rates of crude oil per foot of exploratory drilling against the cumulative total footage of exploratory drilling. A comparison is also shown with the U.S. Geol. Survey (Zapp Hypothesis) estimate.
3 I found a few references which said "The United States has proved gas reserves estimated (as of January 2005) at about 192 trillion cubic feet (tcf)" and from NETL this:
U.S. natural gas produced to date (990 Tcf) and proved reserves currently being targeted by producers (183 Tcf) are just the tip of resources in place. Vast technically recoverable resources exist -- estimated at 1,400 trillion cubic feet -- yet most are currently too expensive to produce. This category includes deep gas, tight gas in low permeability sandstone formations, coal bed natural gas, and gas shales. In addition, methane hydrates represent enormous future potential, estimated at 200,000 trillion cubic feet.This together with the following reference indicate the current estimate of NG reserves lies between 1173 and 1190 TCF (Terra Cubic Foot = 1012 ft3).
How much Natural Gas is there? Depletion Risk and Supply Security Modelling
US NG Technically Recoverable Resources US NG Resources
(EIA, 1/1/2000, Trillion ft3) (NPC, 1/1/1999, Trillion ft3)
--------------------------------------- -----------------------------
Non associated undiscovered gas 247.71 Old fields 305
Inferred reserves 232.70 New fields 847
Unconventional gas recovery 369.59 Unconventional 428
Associated-dissolved gas 140.89
Alaskan gas 32.32 Alaskan gas (old fields) 32
Proved reserves 167.41 Proved reserves 167
Total Natural Gas 1190.62 Total Natural Gas 1779
4 I have an alternative MC algorithm here that takes a different approach and shortcuts a step.

















http://science.reddit.com/info/61jei/comments/
thanks for your support.
Maths vs Politics
I'll repeat a statement made a few times before. If you want to use maths to predict/understand oil production/discoveries etc. then you need to explicitly 'reverse out' the impact of political and economic decisions.
Its no good trying to model areas such as new discoveries if you haven't taken into account the times when countries and oil companies "couldn't be bothered" to search exhaustively since they already had more reserves than they knew what to do with.
Equally if there is a recession and share price is troubled, oil companies will reduce the cost base and limit their exploration.
And then there are the technology shocks which introduce step changes into the system.
Trying to match maths to this noisy data with all these effects still in place artificially limits what you can understand. We KNOW when the recessions were, when the shocks occurred, etc. - so its possible to make allowance for them. Model on the cleaned up data, then add back in the real world effects and you can generate a tool that you can use to predict the 'perfect' world performance, and then add in your expected or observed real world events as they happen - essentially playing 'what if' scenarios.
Frankly, I look a little askew at a model that claims to accurately model the real world data without the taking account of the real world noise. It doesn't pass the sniff test.
Is it really that important?
We are talking an approximation here.
Let's assume the real world shocks are roughly evenly spread out in time, then the noise stays there, but does not affect the overall conclusion that much.
Also, whether the shocks have a relative big enough scale to matter, remains to me at least unproven.
I think the model has a lot of potential and cleaning the real world data would also have it's consequences, just as WHT points out. There have already been other methods using various ways of trying to average out the noise, perhaps introducing additional skew to the data for future extrapolation.
Modelling it with noise intact at least starts with a different premise and the results are encouraging.
However, in an ideal scenario, I think your point about trying to achieve 'perfect world' scenario is a good one.
BTW, great work WHT. Even a person like me who's getting back to physics/maths after at 15+ hiatus could follow the main gist of the article much of the time. That is quite remarkable communication and explanation powers you have!
Well, if you take the global oil production curve and work out the offsets, translations and stretching necessary to turn that shape into a smooth bell curve unconstrained consumption theory predicts - you can readily see how the effects I describe can have a key impact on production data. Its not a big step to suggest the effect is equivalent or greater in other areas.
So yes, I'd say its important.
Your points may have validity which might be checked by doing a sensitivity analysis, but one needs to start with a model, and the simplier the better. With a model in hand, one can introduce factors as you mention, and see how the model results deviate from the data. You can't reject the model as insignificant when you don't know how it responds to the input you suggest.
My suggestion is rather the reverse.
Take out of the base data the effect of known factors, then model the now simpler data. No mathematical equation will match the discontinuous actions of politics and economics - but take them out and you have a chance.
Once you have the model, add the effects back in.
I understand your point. The step changes I think are very deterministic and happen at the last stage of the process, which is why I call them shocks. Everything else about the model is stochastic (one could argue about the power-law growth term, but everything has to have a driving force). So if I could divine what the extractive step changes are, preferably from some real world source, like corporate records or OPEC dictates, I certainly would be more satisfied. Apart from that, having a reasonable set of dimensional parameters that derive from some simple physical models helps to fill in the rest of the puzzle.
But surely you first need to know what will happen if politics is ignored? In this case the politics is a response to the inexorable mathematics of the situation, so you need to establish the mathematically "perfect" situation first.
The politics then changes the situation, but to understand what the politics will do, we need to first understand what happens without politics.
And don't forget that it's possible to go back to mature fields and use techniques like streamline simulation to figure out where to drill production and injection wells to recover more oil... Prof. Akhil Datta-Gupta of Texas A&M University predicts that using this method there are 200 billion barrels that are economically recoverable from mature fields in the Continental US alone. He's recently published a book which includes a CDROM with streamline simulation software (and perhaps source code...):
(from http://www.rigzone.com/store/product.asp?p_id=1842)
Streamline Simulation
Author: Akhil Datta-Gupta and Michael J. King
Format: Softcover
Pages: 394
ISBN: 978-1-55563-111-6
Publisher: Society of Petroleum Engineers
Year Published: 2007
Item Number: 100-1842
Availability: In Stock $149.00
Streamline Simulation: Theory and Practice provides a systematic exposition of current streamline simulation technology—its foundations, historical precedents, applications, field studies, and limitations. This textbook emphasizes the unique features of streamline technology that in many ways complement conventional finite-difference simulation. The book should appeal to a broad audience in petroleum engineering and hydrogeology; it has been designed for use by undergraduate and graduate students, current practitioners, educators, and researchers. Included in the book is a CD with a working streamline simulator and exercises to provide the reader with hands-on experience with the technology.
Contents: Introduction and Overview • Basic Governing Equations • Streamlines, Streamtubes, Streamfunctions, and Simulation • Applications: Field Studies and Case Histories • Transport Along Streamlines • Spatial Discretization and Material Balance • Timestepping and Transverse Fluxes • Streamline Tracing in Complex Cell Geometries • Advanced Topics: Fluid Flow in Fractured Reservoirs and Compositional Simulation • Streamline-Based History Matching and the Integration of Production Data
"...so you need to establish the mathematically "perfect" situation first."
“Poli” a Latin word meaning “many”.
Politics is the lubrication of the "maths" model.
Nothing happens w/o politics.
To take politics out is to say "why do we need algebra. We'll never use it."
garyp, This is a great argument you make.
Yet I think you forget that any lapses in exhaustive searches by well-fed companies are more than taken up by hungry newcomers on the scene. Where there is money to be made, people will contribute to the gold rush. And greed is the essential driving force, which has to first-order never been known to abate in the history of mankind. Even at the lowest point in the value of oil, it would still be worth a fortune for the fortunate prospector.
I also think I have taken into account noise and uncertainty by the dispersive model itself. And the oil shock model adds another level of uncertainty, up to the point that shocks are used to model the step changes you talk about.
So it is in fact a bottom-up stochastic model, but then perturbed by top-down deterministic considerations.
OK.
The word "noise" is being thrown around here.
Noise IS Chaos Theory.
Data Mining using Fractals and Power Laws.
The model, therefore, only mimics power law behavior.
"...we do know that, overall, it will fit within the mathematical pattern of the power law distribution.
The interesting thing is that this mathematical relationship is found in many other seemingly unrelated parts of our world. For example, internet use has been found to fit power law distributions. There is a small number of websites that attract an extremely large number of hits (e.g. Microsoft, Google, eBay). Next there is a medium number of websites with a medium level of hits and finally, literally many millions of sites, like my own, that only attract a few hits.
The number of species and how abundant each one is in a given area of land also fits a power law. There will be a few species that are very abundant, a medium sized number of medium abundance and many species that are not very abundant."
http://complexity.orcon.net.nz/powerlaw.html
In other words, we know the largest fields have been found.
There are no others that can be accessed.
http://www.cis.fiu.edu/events/lecture55.php
The link above to:
Data Mining using Fractals and Power Laws.
"This supports the old adage that money makes money and the rich grow richer, and the poor grow poorer. The longer the interactions continue and the more people who join in, the more striking will be the difference between rich and poor. This also links to the principle of Chaos Theory, that such systems are very dependent on initial conditions. A small advantage at the beginning is far more likely to result in a high ranking than a small advantage later on, when other agents have already gained significant advantages.
Same with oil fields.
Wow, you make applied math yummy.
Why this bunfight over global warming? The real crisis is peak oil and peak energy. We probably don't have enough fossil fuels left to meet the IPCC's low emisions scenario.
If we do have enough FF's to fry ourselves I will sleep easier, as we would avoid the train wreck we are about to hit in the next few years.
Let's stay on topic for dispersive discovery model. The PO/AGW link has been beaten to death in various drumbeat threads.
I take responsibility for placing the bait at the top of the post.
GW becomes real to 5 million folks in Atlanta,
wondering where their water comes from, in 70 days.
And there are four cities in CA ahead of Atlanta
in the "BCS Drought Bowl".
Actually looks like the weather pattern is looking a bit better for Atlanta now. Their a long way from getting out of drought status, but they have been getting some decent rains of late, with more in the forecast.
The latest front brought .66 inches.
Until they get hurricane-type totals, the drought
continues.
70 Days.
I have very little sympathy.
As someone who has lived with water restrictions for 20 years in Florida, I am stunned by the stupidity and selfishness of Georgia and Atlanta's public leadership. To know that there is a problem and willfully ignore it because of a faith based upon the availability of resources at some point in the future is, at best, reckless.
Further, a complete lack of growth planning in the Atlanta area is only making the problems worse. Seems that the "free market" cannot solve all problems.
Of course, Georgia's answer is to screw north Florida and Alabama so they can continue to water their lawns on a daily basis.
Oh, and prayer.
"...a complete lack of growth planning in the Atlanta area is only making the problems worse."
I've been saying that since 1980.
When I was traveling from Miami up to Atlanta and Jacksonville
to Pensacola.
I saw Atlanta destroying it's watershed.
There's really nothing left now. Except 5 million people.
Folks talk like Phoenix and LV are walking extinctions.
I think Atlanta's just as badnow.
I do feel for the entire Chattahootchie watershed.
http://www.noaanews.noaa.gov/stories2007/images/temperaturemap111507.jpg
Latest forecast. We've left the Holocene. We're now in the Eemian Interglacial.
Sincerely,
James
WHT - I really feel like a good argument - but unfortunately just don't have time as I'm trying to get some work finnished before the Century draws to a close. However, since you are inviting a debate, lets start here:
I think this statement is wrong and needs to be qualified with a geographic / geologic scale. If you look at at any basin, I'm pretty sure you'll see that the majority of the giants are front loaded. Certainly the case in the North Sea with Ekofisk, Statfjord, Brent, Forties et al., all discovered early in the cycle.
There is a reason for this, and that is large oil companies go out looking for large fields - elephants no less.
So if the statement you make is true for the Earth, that is because we have sequentially explored the World's basins - resulting in giant discoveries being spread throughout the 20th century. The point now being that we are fast running out of new basins to explore. So I think you will find that the vast majority of giants were found in the 20th Century - even though much of their oil will be produced in the 21st.
Figure 26 is a classic - and thought provoking for the peak now brigade - which I almost joined.
Let's say you are correct, and he is making a poor assumption.
Now let's say we have a doubter about your position.
You can thus say, "Alright, let's look at a production model that doesn't assume the big fields are nearly all discovered early."
Doesn't seem to change the basic extraction rate over time!
If it can be shown that a minimal set of assumptions, all of which can be backed by hard evidence and clear logic, lead to the same conclusion the general case is easier to understand.
(Though perhaps the rate of decline is slowed, which is important).
What then are the minimal assumptions that go into any model of rising extraction, a peak, then decline?
I think I tried to make the same point in the post. By not assuming that big oil discoveries are always found first, you get a very conservative and unbiased model, which if anything will defer discoveries to the future and thus provide a more optimistic estimate of decline. But even with this extra optimism, in the greater scheme of things, it doesn't help that much.
To interpret and answer the last question, I think the minimal assumptions are that the human search function is monotonically increasing in rate (i.e. people try harder and harder over time), a finite volume is searched, and that a dispersion in rates takes over to demonstrate the decline as the various searches overlap the extents of the finite volume.
That would be entirely correct, but fact remains that for the last 15 years, oil research investments have been very very low, for there were no economic value in such. Thus, we can say what you said, but because in a simple model, Oil price should rise steadily si