IPv4 Address Space Utilization
NOTE: The text on this page is still getting developed
and may be a bit hard-to-follow in some places.
That should, hopefully, be corrected shortly.
Additional text and graphs about the individual address classes
will also be added.
Feel free to send questions or suggestions to the email
address on the bottom of this page.
IPv4 Address Space Utilization
This page summarizes the results of a statistical analysis
on assignment of IPv4 addresses over the last several years
as reported by the
American Registry for Internet Numbers.
The viewpoint of this analaysis is from the perspective of ARIN who
administers blocks of IPv4 addresses to other registries,
Internet Service Providers (ISPs), etc.
The address/mask pairs are listed along with a date when the block
of addresses was directed from ARIN. There's no attempt to determine
what proportion of addresses within a block are currently in use
(see BGP System usage in Related Inforation):
the goal is to determine in what sort of time frame ARIN becomes
affected by limitations of the 32-bit address space.
The address dump used includes overlapping address ranges, usually
with one address block being a subset of another. This can be
interpreted as a second-order delegation of addresses within the
larger block or as the larger block having been surrendered by
its original owner at some point in the past and the smaller blocks
representing the redeployment of the addresses to other owners.
Since the dump does not indicate which scenerio affects any given
address region, I am assuming the former in all cases. This would
have the effect of overestimating the historic values, reducing the
overall growth rate towards the current levels.
The math
I'm fitting the data into a logistic distribution,
a popular model for measuring life-and-death processes.
The formula is of the form
Pt = C / (1 + e A*t + B)
where Pt is the population size at time `t'
and A, B and C are the coefficients which are estimated
using non-linear regression. The C parameter also
indicates where the population reaches a steady-state.
The raw population numbers were scaled down into percentages
of the total available numbering space to make the values
easier to relate to. The model, however,
does not enforce a hard limit at 100% since what
I'm trying to gauge is the total demand for IPv4 address
space if the supply were infinite.
One point that must be emphasized is that these are
trends and not
predictions.
Past performance does not guarantee future results.
Before extrapolating these lines out into the future,
one must recognize that these curves are
estimated by looking at historic data and
are in fact only modeling IPv4 address usage within the same
areas that they have traditionally been associated.
The implicit assumption is that there is not some form of
paradigm shift that either dramatically increases or eliminates
the demand for new addresses in the near future.
One aspect of the current paradigm that bears special
consideration are the policies that ISPs
are following in reguard to delegating
portions of IPv4 address space to their subscribers.
If they were to become less concerned about
conserving IPv4 address space,
there would likely be greater demand in the short-term
and the curves would be raised proportionally.
The pictures
The "Address Space Delegation" graph represents the best fit
against the ARIN's report for address space usage.
As of 2 September 1998, ARIN reports that 42.97% of
the IPv4 address space as having been at least delegated to other
providers.
The estimated curve suggests that the total demand levels
off at just over 43.79% of the address space.
This estimate needs to be qualified, however, since the
model has been underestimating the actual data for
the last several months now
As more data becomes available over the next few months
and exerts a greater influence upon the rest of the distribution,
the trend line will be pulled upwards.
This "Variance" graph show the 95% confidence interval
on the estimate of the limit that the delegation line approaches.
This is to determine how consistant the estimate
for the size of the resulting population
­ the `C' parameter of the model ­
has been as new data becomes available.
This illustrates the qualification in the previous paragraph:
the last point is the confidence interval on the estimate
based on the currently available data (43.79 +/- 0.90%);
the previous points in the graph
are the confidence intervals on the regressions
when I drop the data after the corresponding date.
By way of example, I drop all of the network numbers that are
listed as having been delegated between 1 August and 2 September
then run the regressions again to reach the value at the next-to-last
point.
If that is an accurate representation on what the state of the
world was on 31 Jul 1998, the estimate would have been
43.58 (+/-0.90)% of the total address space.
One will note that the estimates for maximium address space
utilization have risen about 8% over the last 3 years.
If one were to argue for extrapolating this over time as well,
the resulting statement would be that
"in the year 2019, the trend line will suggest that we will
eventually run out of IPv4 addresses".
So what happened...
to the predictions trend lines that suggested
that we were running out of IPv4 addresses?
Some possible factors, in no particular order:
- The older trend lines were simply fitted to the curve too soon;
that is, before the underlying curve had started to flatten out.
When that happens, the data that's available at that time shows
only an increasing rate of change and more closely resembles an
exponential distribution.
The real leveling-off point can't be reliably estimated until the
rate of change has started to drop.
So what caused the rate of change to start to drop off?
This could have been either a natural effect
('there never was a problem')
or could have been a result of ARIN's and the ISP's
more stringent address allocation policies
('there was a problem but we saw it in time and addressed it',
so to speak). I like to believe the latter.
- In the "new technology" arena: it's possible that the use of
Network Address Translators (NAT) are masking the existance
of private internets
(ref: RFC-1918)
that would have otherwise been taking up a greater portion
of the public IPv4 address space.
The difficulty, however, is that there is no way of ascertaining
with any degree of certainty how much of an impact that this has had
since the addresses that are on front end of NAT boxes look exactly
like the rest. The number of addresses that the block truly
represents can't be determined without cooperation of those nodes.
It's difficult to believe that this has made a significant
contribution up to this point, though, since NAT boxes seem to be
relatively recent technology and not likely to have been widely
deployed yet. The time that the data starts to flatten out
seems to predate the availability of these devices.
- The estimates on IPv4 address space utilization
in the past don't match up with the levels reported by
InterNIC when it was acting in the capacity currently served by ARIN.
By way of example: the current dump
suggests that 48.25% of the 128/16 ("class B") space was delegated
in August 1993. At that time, their reports indicated that 56.40%
was delegated: only slightly less than the level that is currently
being reported (58.23%).
Their data reporting methodology has changed considerably between
then and now; it's possible that there may be bugs in either the old
or the new approaches.
Related information
Frank T Solensky
<solensky@TopLayer.com>
Last updated: 12 March 1999