Fredric Messing
Computer Sciences Corporation
10000 Aerospace Road, Lanham-Seabrook, Maryland 20706, U.S.A.
Fax: 301-552-3272
E-Mail: Fred Messing/SSD/CSC@CSCmail.CSC.com
ABSTRACT. An analytical formulation is derived to predict the success of scheduling activities on discrete multiple resource time lines using sequential approaches. Success is defined in terms of the probability of scheduling a single activity and the number and cumulative duration of scheduled activities. The results are extended to include scheduling activities with flexible start times. The principal assumption is that the activity start times are randomly distributed over the available time in the time line.
1. INTRODUCTION
This analysis addresses a type of scheduling problem frequently referred to as activity scheduling. Each activity is assumed to use one of a set of equivalent resources. Each resource can be used to perform only one activity at any time. The activities are independent and have no predecessor relationships. Each activity has a specified duration and start time, although the possibility that the start time is flexible is also considered.
Scheduling becomes challenging when not all of the activities can be scheduled because of conflicting demand for the resources. The question then becomes: which activities get scheduled and at what times, and which do not get scheduled?
Ideally, an objective should be defined that can be used to identify the optimal schedule and select a scheduling approach that achieves or nearly achieves an optimal schedule as defined by that objective. A typical objective might consist of maximizing the sum of values assigned to each scheduled activity. If the value were the same for each activity, then maximizing the value is equivalent to maximizing the number of scheduled activities. If the value were proportional to the duration of the activity, then maximizing the value is equivalent to maximizing the total time scheduled.
Because optimally solving such problems is complex, most approaches do not attempt to achieve optimality directly and have resorted to a sequential scheduling approach (see Figure 1). A sequential scheduling approach typically begins by using heuristically determined metrics to order the activities by priority. It then considers each activity in priority order and attempts to find an available time for using one of the resources. Assuming such a time exists, a second heuristic approach determines the start time. If previously scheduled activities conflict with all possible start times, the activity is not scheduled. In either case, the next activity on the list is then considered for scheduling.
The principal method for evaluating scheduling approaches is to determine the extent to which the objective is met. Evaluation is typically accomplished by establishing benchmark problems and generating test schedules. The evaluation criteria generally include the fraction of activities and the fraction of activity time that gets scheduled. This paper provides an analytical technique for predicting scheduling success in these terms.
A closely related issue is the fraction of available resource time that gets scheduled. Providing resources is generally costly, and, before spending money to provide additional resource, managers want to be sure that the existing resources are being used efficiently to perform the specified activities.
The first part of this paper presents the derivation of the probability of a single additional activity being successfully scheduled on either a single or multiple resources. An iterative process is then defined to determine the overall probability of successfully scheduling any number of activities.
Next the benefits to improved scheduling success from start-time flexibility are included. Finally, the distribution of gaps remaining in the time line is discussed.
2. SINGLE-ACTIVITY SCHEDULING SUCCESS
Scheduling success probability is derived by considering an attempt to schedule a single additional activity on a resource time line that contains a number of activities already scheduled. Given a single, discrete, resource time line (see Figure 2) with n randomly scheduled activities at start times and with durations
(1)
such that
(2)
consider a new activity with duration
and random start time
. The new activity will not be scheduled successfully
if its start time conflicts with any previously scheduled activity
or if a previously
scheduled activity has a start time that conflicts with the new
activity
.
Since
is uncorrelated with any previously scheduled activity,
the probability that it will not conflict with a
previously scheduled activity is given by the fraction of the
time line remaining unscheduled
(3)
where
= the length of the time line
and the remaining time is given by
(4)
The probability that no previously scheduled activity has a start
time that conflicts with the new activity is determined by considering
a compressed time line of length
(see Figure 3), generated
by removing the scheduled activity durations from the available
time line. The previously scheduled activities appear with zero
duration randomly distributed throughout the time line. For each
of the n previously scheduled activities, the probability
that start time
will not
conflict with the new activity
is given by
(5)
under the assumption that
and
(6)
to avoid any effects from the ends of the time line. Combining
the probability that
is not in conflict with any previously
scheduled activity, with the probability that none of the
are in conflict with the new activity, results in the probability
that the new activity can be scheduled
(7)
In the limit that n becomes large while
remains fixed
(8)
where
(9)
(10)
If each activity can be scheduled on any of m equivalent
unconstrained resources, then
becomes the probability
that the new activity can be scheduled on each of the m
resources. The probability of successfully scheduling an additional
activity on any of the m resources is then
(11)
Figure 4 shows the probability of successfully scheduling a new
activity of average duration as a function of the already scheduled
load. For small values of L, the probability decreases
as
. For larger values of L, the exponential
causes the probability to fall more rapidly. For a single resource,
by the time L has reached 30 percent, the scheduling success
for a new activity has fallen to 46 percent. Four equivalent resources
are required to keep the single-activity scheduling success above
50 percent for 50 percent loading.
3. INTEGRATED SCHEDULING SUCCESS
Scheduling success integrated over all activities is determined
by applying
iteratively. Consider N activities
with random start times to be scheduled on m equivalent
resources. The first m activities can each be scheduled
successfully without conflict, one activity on each resource. Therefore, the number of activities and the
scheduled load are
(12)
(13)
For activities j=m+1,N, the steps are
1. Compute the probability
to schedule activity j,
given n(j-1) previously scheduled activities out of j-1
attempts
2. Update the number of activities scheduled out of
j attempts
n(j)=n(j-1)+
3. Compute the scheduled load
(14)
The average scheduling success can then be computed as the ratio of the number of scheduled activities to the number of attempted activities
(15)
or as the ratio of the scheduled time to the attempted time
(16)
If all of the activities are of the same duration, then the two measures are identical. Figure 5 illustrates the integrated scheduling success when all activities are of the same duration. If the demand is for 50 percent of the available time on 4 resources, 90 percent of the activities will be scheduled successfully. With 2 resources, 79 percent will be scheduled successfully. If the demand is for 70 percent of the resources, the success for 4 resources drops to 79 percent and it drops to 69 percent for 2 resources.
The activity start times for some scheduling problems are not
fixed. Rather, they have flexibility
such that they can be scheduled
to start at any time between
and
+
.
The benefit of the start-time
flexibility can be determined by considering an activity (see
Figure 6) with start time
that conflicts with a previously scheduled
activity of duration
, scheduled to start at time
. If
,
then the conflict with activity i can be resolved by adjusting
the new activity start time to
. The probability that this resolution is possible
is given by
(17)
for
(18)
For larger values of
, the probability of resolving the conflict
is 100 percent. The average probability for resolving a conflict
with the new event start time is given by the product of the probability
that the new activity conflicts with the previously scheduled
activity i, multiplied by the probability that a conflict with
activity i can be resolved, summed over all previously scheduled
activities
(19)
where f is the normalized flexibility
(20)
The probability of successfully scheduling the new activity is
determined by multiplying Lf by the probability that another
activity will have been scheduled in conflict with the adjusted
activity and by adding the result to the previously determined
value of
(21)
Figure 7 shows the probability for successfully scheduling a new activity of flexibility f = 1 of average duration as a function of the already scheduled load. Because of the flexibility, this data shows significant improvement in scheduling success when compared with Figure 4. For a single resource scheduled at 30 percent of available time, the scheduling success for a new activity increases from 46 percent to 65 percent. With 4 equivalent resources, 50 percent scheduling success can be maintained up to a demand of 65 percent of available resources.
The improvement in integrated scheduling success is illustrated in Figure 8. Increasing flexibility from f= 0 to f = 0.5 increases the scheduling success by approximately 5 percent for high levels of demand. Increasing flexibility to f = 1 increases scheduling success by an additional 5 percent.
As previously indicated, the exponential term in
dominates
the linear term for larger values of scheduled load L.
The impact of the exponential term can be partially reduced by
first scheduling the larger duration activities and then scheduling
the shorter duration activities.
Figure 9 illustrates scheduling success when the demand above 40 percent is divided into twice the number of activities, each activity of half the duration of the activities below 40 percent. This success is compared to scheduling success when all activities have the same duration. With the reduced-duration activities, scheduling success is increased by approximately 5 percent. In practical applications, durations of unschedulable activities can be reduced to improve scheduling success.
6. HIGHLY FLEXIBLE START TIMES
As the flexibility of start times increases, the probability of
successfully scheduling an activity increases. For values of
,
conflicts of the new activity start time
with a previously scheduled
activity can always be resolved (see Figure 6) by delaying the
new activity to start at the end of the conflicting activity.
The linear term in the scheduling probability is eliminated, and
the single resource success probability becomes
(22)
This probability is actually a lower bound. The new activity is
also schedulable if the start time
of a previously scheduled
activity conflicts with the new activity and if
.
For values of
(for an average-duration
activity
, this value corresponds approximately
to
), the success probability increases further,
as illustrated in Figure 10.
If the length
of the first
gap following the start time
of the new activity is less than
the duration of the new activity,
, then the new
activity will not be schedulable in that gap. This probability
is given by
(23)
If the new activity is not schedulable in the first gap, then its start time can be delayed to the end of the next previously scheduled activity. The probability is identical that the gap following this activity is also too small in which to schedule the new activity. Consequently, the probability that the new activity will be schedulable in one of these two gaps is given by
(24)
As
increases further, scheduling success continues to increase.
Figure 11 illustrates the significant gain in scheduling success
that accrues from this added flexibility for an average-duration
activity. It also illustrates that, even with this amount of flexibility,
it is difficult to successfully schedule beyond a demand of 60
percent of the resources.
This conclusion depends on the assumption that scheduled activities are placed randomly within their schedulable start-time flexibility. Techniques exist for selecting start times to optimize resource use. These techniques effectively reduce the size of small gaps and increase the size of large gaps, thereby improving scheduling success.
The difficulty in scheduling more than 60 percent of the resources results from both the time line being heavily scheduled and the remaining gaps being too small for additional activities to be scheduled. This problem can be understood by determining the distribution of gap sizes.
The probability for an individual gap to have a length greater
than value g can be determined by selecting the end,
,
of any scheduled activity and by considering the probability that
no other activity is scheduled within gap g from this point. This
probability is precisely the same probability derived earlier
for scheduling an activity with a non-conflicting start time
(25)
The distribution of gap sizes (see Figure 12) is given by the probability of finding a gap between g and g+dg
(26)
As the resource becomes more heavily scheduled, the remaining gaps become significantly smaller than the average activity duration, making them unusable for scheduling average-duration activities. The amount of time T(g) remaining in gaps larger than g is given by
(27)
(28)
The ratios of T(g) to
and
are given
by
(29)
and
(30)
which, in the case g=<d>, go to
(31)
(32)
Figure 13 illustrates the fraction of the time line remaining in gaps larger than the average activity duration as a function of scheduled load L. For example, when 50 percent of the time line has been scheduled, only 37 percent of it consists of gaps larger than an average-duration activity. When L reaches 70 percent, only 10 percent of the time line consists of gaps larger than an average-duration activity. The remaining 20 percent is in gaps that cannot be used to schedule activities of average or larger duration. The time in these gaps can be recovered by adjusting activity start times to reduce the size of small gaps and increase the size of large gaps.
8. CONCLUSION
This paper provides formulae to compute success for scheduling individual activities with start-time and duration flexibility on single or multiple resources. An iterative technique is presented for determining scheduling success integrated over all schedulable activities. It demonstrates the significant increase in scheduling success that can be achieved when scheduling flexible activities. It also provides a distribution of sizes of gaps remaining in the time line and demonstrates the dramatic decrease in time remaining in large gaps as scheduled time increases.
This analytical formulation can be used to calculate realistic estimates of scheduling success without actually developing schedules. Such estimates can be used for capacity planning or predicting scheduling success for varying combinations of activities and resources.