OBABILISTIC AWARD STRATEGY FOR CONTRACT NET

PROTOCOL IN MASSIVELY MULTI-AGENT SYSTEMS

Toshiharu Sugawara

∗

Computer Science and Engineering, Waseda University, Tokyo 169-8555, Japan

Toshio Hirotsu

Computer and Information Sciences, Hosei University, Tokyo 184-8584, Japan

Kensuke Fukuda

National Institute of Informatics, Tokyo 101-8430, Japan

Keywords:

Task and resource allocation, Load-balancing, Massively multiagent systems, Contract net protocol.

Abstract:

We propose a probabilistic award selection strategy for a contract net protocol (CNP) in massively multi-agent

systems (MMASs) for effective task allocations. Recent Internet and sensor network applications require so-

phisticated multi-agent system technologies to enable the large amounts of software and computing resources

to be effectively used. Improving the overall performance of MMASs in which thousands of agents work

concurrently requires a new negotiation strategy for appropriately allocating tasks to agents. Our proposed

method probabilistically selects the awardee in CNP based on the statistical difference between bid values for

subtasks that have different costs. We explain how our proposed method can signiﬁcantly improve the overall

performance of MMASs.

1 INTRODUCTION

Recent advances in many domains, such as the In-

ternet, sensor networks, and grid computing, (Foster,

2002), have increased the need for technologies for

massively multi-agent systems (MMASs), in which

thousands of agents interact with one another. In par-

ticular, a technology is needed for allocating tasks

generated in real time to appropriate agents in accor-

dance with their skills and abilities so that the abili-

ties/resources of all agents are maximally used. Task

allocation has thus attracted a great deal of attention

in multi-agent systems for the purpose of obtaining

efﬁcient and high-quality services.

A number of negotiation protocols have been pro-

posed for task allocation and the contract net pro-

tocol (CNP)(Smith, 1980) has especially been im-

∗

This

research was supported in part by Kayamori Foun-

dation of Information Science Advancement and Grant-in-

Aid for Scientiﬁc Research from the Japan Society for the

Promotion of Science.

plemented in various applications (Sandholm, 1993;

Weyns et al., 2006). In CNP, an agent plays one

of two roles: managers are responsible for allocat-

ing tasks and monitoring processes and contractors

are responsible for executing the allocated tasks. A

manager agent makes a task known to the contractor

agents in the announcement phase, and they then bid

for the task on the basis of certain values (such as cost,

duration, or payment) in the bid phase. The manager

awards the contractor (or awardee) who made the best

bid in the award phase. There have been a number of

studies in which the performance and characteristics

of CNP have been investigated (e.g. (Gu and Ishida,

1996)). However, most have assumed CNP in small-

scale, less busy environments.

Unfortunately, the performance of CNP in an

MMAS is poorly understood. Clarifying this is im-

portant because interference among agents occurs

with this kind of negotiation protocol if many man-

agers have tasks to allocate. In naive CNP, a contrac-

tor agent responds to task announcements one by one,

165

Sugawara T., Hiortsu T. and Fukuda K. (2010).

PROBABILISTIC AWARD STRATEGY FOR CONTRACT NET PROTOCOL IN MASSIVELY MULTI-AGENT SYSTEMS.

In Proceedings of the 2nd International Conference on Agents and Artiﬁcial Intelligence - Agents, pages 165-171

DOI: 10.5220/0002712201650171

 SciTePress

but because many managers announce tasks simulta-

neously in a busy MMAS, the managers may have to

wait a long time to receive a sufﬁcient number of bids:

This signiﬁcantly reduce the performance of the entire

system. In the original conception of CNP (Smith,

1980), the use of multiple bids was proposed to con-

currently handle many announcements. If a contrac-

tor is awarded multiple bids simultaneously, however,

it may not be able to provide the quality or perfor-

mance guaranteed in the bids. In fact, more highly

capable contractor agents tend to be selected by many

managers. Additionally, the task structure, meaning

a task consisting of a number of different subtasks,

makes this situation more complex.

In this paper, we propose the award strategy,

called the adaptive probabilistic award strategy, to

improve the overall performance of MMAS. The ﬁrst

key idea of the proposed strategy is the probabilis-

tic selection of awardee according to the task loads

of the system. However, the task loads of the system

are hardly given to each agent. Thus the second idea

is that manager agents estimate the task loads using

statistical data (more precisely, the difference in the

standard deviation – SD) of bid values for different

subtasks, where we assume that a task consists of a

number of subtasks that have different costs.

This paper is organized as follows. First, we will

discuss the model of CNP, which has been slightly

modiﬁed for MMASs to avoid long waits and to re-

duce the number of messages, the simulation environ-

ment and the issues addressed in this paper. Then, we

clarify how some degree of ﬂuctuation can improve

the overall performance even if tasks have structures

and, by taking advantage of this effect, we propose

the adaptive probabilistic award strategy. Finally, we

experimentally show how our proposed method can

signiﬁcantly improve overall performance.

2 MODEL AND ISSUES

2.1 Model of CNP for Massively MASs

Let A = {1,...,n} be a set of agents, T be a task,

and F = { f

,.. ., f

} be the set of skills, or func-

tions that agents can perform. We assume that task

T consists of subtasks, t

,.. .,t

, (therefore, we de-

note T = {t

,.. .,t

}) and that subtask t(∈ T ) requires

s(t)-th skill, f

s(t)

, to perform it, where 1 ≤ s(t) ≤ d.

A subtask is denoted by lower-case letter t and is

simply called a task unless this creates confusion.

Agent i is expressed as a tuple, (α

), where

= (a

,.. ., a

) is the set of the agent’s capabilities

corresponds to the l-th skill, f

, and a

≥ 0; a

= 0

indicates agent i does not have skill f

), L

is the loca-

tion of i, and Q

is the queue where the agent’s tasks

are stored, waiting to be executed one by one. Set

(⊂ A) is i’s scope, i.e., the set of agents that i knows.

The metric between agents, δ(i, j), is based on their

locations, L

and L

, and is used to deﬁne the commu-

nication time (or delay) of messages between i and

Subtask t has an associated cost, γ(t), which is the

cost to complete it. Subtask t can be done by i in

dγ(t)/a

s(t)

e unit times, where dxe denotes the ceiling

function. This time is also called the execution time

of t by i. Task T is completed when all its subtasks

are completed. The cost of T is deﬁned as γ(T ) =

∑

t∈T

γ(t).

In every unit time, tl(≥ 0) tasks on average are

generated according to a Poisson distribution and ran-

domly assigned to different managers. Parameter tl is

called the task load and denotes tl tasks per unit time,

or simply tl T/t.

For CNP, we deﬁned M = {m

}(⊂ A) as the set

of managers, who allocate tasks, and C = {c

}(⊂ A)

as the set of contractors, who execute the allocated

tasks. Let us assume that |A| is large (on the order of

thousands), therefore |M | and |C | are also large, and

that the agents are distributed widely, like servers on

the Internet.

2.2 Task Allocations for MMAS

In our experiments, we used the CNP modiﬁed for use

in an MMAS to reduce the number of messages and

to prevent long waits for a response. In this CNP, (1)

multiple bids and regret and no-bid messages are al-

lowed, and (2) manager m announces subtasks in T to

restricted contractors that are selected from its scope,

, on the basis of an announcement strategy. Regret

messages are sent in the award phase to contractors

who have not been awarded the contract; no-bid mes-

sages are sent to managers by contractors who have

decided not to bid on an announced task. These mes-

sages prevent long waits for bids and award messages

(e.g., (Sandholm, 1993; Xu and Weigand, 2001)).

When manager m receives task T , it immediately

initiates the modiﬁed CNP to allocate each task

t(∈

T ) to an appropriate contractor agent. It ﬁrst sends

announcement messages to the contractors selected

from its scope in accordance with the announcement

strategy. Each contractor receiving the announcement

sends back a bid message with a certain value called

the bid value. The bid values in general might include

parameters such as the price for executing the task, the

quality of the result, or a combination of these values.

Because we assume that agents are rational in terms

ICAART 2010 - 2nd International Conference on Agents and Artificial Intelligence

166

of efﬁciency, their bid values contain the guaranteed

times for completing the task. Thus, the bid value

of contractor c is dγ(

t)/a

e +

∑

t∈Q

dγ(t)/a

s(t)

e + β,

where β is the time required to complete the task cur-

rently being executed. With multiple bidding, c might

have a number of outstanding bids. These bids are not

considered because it is uncertain whether they will

be accepted. Finally, manager m selects a contrac-

tor, the awardee, on the basis of the award selection

strategy, and sends the awardee a message with the

announced task. The awardee is usually the one that

make the best bid (here, the lowest value).

2.3 Issue

We assume that manager agents can observe, for each

subtask, the completion time, which is the elapsed

time from the time the award message was sent to

the time the message indicating that the task has been

completed was received. The completion time thus

includes the communication time in both directions,

the queue time, and the execution time. The overall

efﬁciency of an MMAS is deﬁned as the average com-

pletion time observed for all managers; as this value

is used to evaluate the system’s performance, it is re-

ferred to as the overall performance. The issue we

addressed here was to investigate the overall perfor-

mance of an MMAS under a number of award strate-

gies and to improve it by combining the advantages

of a number of award strategies.

From the viewpoint of the overall performance of

a MMAS, we have already tackled this issue and in-

vestigated the performance of an MMAS, especially

its overall efﬁciency, when tasks were allocated using

CNP with a variety of manager-side controls in the an-

nouncement and award phases, under the assumption

that all agents were cooperative and rational in terms

of efﬁciency (therefore, their bid values contained

estimated times for completing the task)(Sugawara

et al., 2008a; Sugawara et al., 2008b). This assump-

tion is reasonable because timely responses are al-

ways of great concern in interactive and realtime ser-

vices. We then found that by introducing a small ﬂuc-

tuation in the award phase of CNP according to the

task loads the overall performance could considerably

be improved. However, because they assumed that a

task had no structure, i.e., a task consisted of a single

subtask, their model could only be applied to limited

applications.

Thus, we aim to extend this approach to more gen-

eral tasks that have a certain task structure. The ex-

tension of the method proposed in (Sugawara et al.,

2008a; Sugawara et al., 2008b) is not trivial. The key

information to apply their method is the overall task

load in the environment and this was estimated from

queue lengths that were directly announced by local

contractors. Queue lengths cannot, however, correctly

indicate the workload in general if tasks consist of a

number of different subtasks, because the queued sub-

tasks do not involve the same costs.

2.4 Simulation Environment

We set |C | = 500 and |M | = 10,000 in our simu-

lation. We assumed that the contractor agents were

servers running on the Internet, providing services

requested by manager agents, which correspond to

clients in users’ sides. The agents were randomly

placed on a 150 × 150 grid with a torus topology. The

Manhattan distance between agents i and j was intro-

duced as the metric. The communication time ranged

from 1 to 14 (in ticks, the unit of time in the simula-

tion), in proportion to the value of δ(i, j).

For simplicity, let us ﬁrst assume that T = {t

}

where γ(t

) = 2500 and γ(t

) = 500. Contractor c

was assigned different capabilities so that the values

of γ(t

)/a

∈ C ) were uniformly distributed over

the range 20–100. Since γ(t

) = 2500, the values of

ranged from 25 to 125. We also assumed that man-

ager agents could not do the tasks (a

= a

= 0) (so

they had to assign the tasks to agents who could), and

that a

= a

; this means that a high-performance PC

can execute any task effectively (if the functions are

deﬁned).

The results presented here are the mean values

from nine independent trials. In these trials, the max-

imal numbers of being executed T every tick, which

were derived from the cumulative capabilities of all

contractors

∑

c∈C

, ranged from 8.15 to 8.30 T/t,

with an average of 8.25 T/t. This is the theoretical

upper limit, meaning that if the task allocation was

ideal, they could execute 8.25 tasks every tick.

When contractor c was awarded a task, it imme-

diately executed it if it had no other tasks. If c was

already executing a task, the new task was stored in

. The tasks in the queue were executed in turn. The

queue length can be ﬁnite or inﬁnite, and we assumed

that it was inﬁnite in this paper.

Manager m’s scope, S

, consists of the nearest

50 or more contractors according to this distance.

More precisely, for integer n > 0, let S

(n) = {c ∈

If the queue is ﬁnite, a number of tasks are dropped

when agents are busy, but this enables agents to recover

from the busy state. However, we aims at investigating the

overall performance by appropriate task allocations (load-

balancing), we@ daringly assume that the queue length is

inﬁnite. Note that the characteristics described in this paper

are almost identical to those when this is ﬁnite.

PROBABILISTIC AWARD STRATEGY FOR CONTRACT NET PROTOCOL IN MASSIVELY MULTI-AGENT

SYSTEMS

167

-40

-30

-20

-10

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 110000 120000 130000 140000 150000 16000

VA S

PAS

time (ticks)

Improvement ratio (%)

0.1

0.5 1 2 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 9 9 8 7.5 7 6.5 6 5.5 5 4.5 4 3.5 3 2 1 0.5 0.1

Figure 1: Ratios of completion times under PAS

(k=3 or 6) and VAS.

C|δ(m, c) ≤ n}. Then, it follows that S

(n) ⊂ S

(n +

1). S

is deﬁned as the smallest S

(n), such that

(n)| ≥ 50. Then, we ﬁxed the announcement strat-

egy in which m announced tasks to only 20 contrac-

tors

who were randomly selected from S

3 EFFECT OF PROBABILISTIC

AWARDING

(Sugawara et al., 2008a) reported that some degree of

ﬂuctuation in the award phase could improve overall

performance when a task had no structure. Our ob-

jective of the ﬁrst experiment to verify this effect oc-

curred when a task consisted of a number of subtasks.

After a task is announced, manager m would re-

ceive bids from a number of contractors, {c

,.. ., c

We denote the bid value from contractor c

as b(c

In naive CNP, m selects the contractor who submitted

the best (lowest) bid. In our ﬁrst award strategy, the

awardee is selected according to the following proba-

bility:

Pr(c

) =

1/b(c

)

∑

j=1

1/b(c

)

. (1)

This probabilistic award selection strategy is denoted

by PAS

. Variable k is called the ﬂuctuation factor.

The larger the k, the smaller the degree of ﬂuctua-

tion; PAS

and PAS

∞

correspond to “random selec-

tion” and “no randomness.” Therefore, PAS

∞

is the

award strategy in the naive CNP.

We evaluated the overall performance for task

load, tl, gradually increasing it from 0.1 (idle) to 9

(extremely busy, over the cumulative capabilities) ev-

ery 5-K ticks and then returning to 0.1. The total du-

ration was 160-K ticks. We plotted the improvement

ratios from PAS

to PAS

∞

(= CNP), which was calcu-

The overall performance varied depending on this num-

ber and was mostly optimal when it was 20. The details

have been reported in (Sugawara et al., 2007).

lated using

CNP

(PAS

) =

℘(PAS

∞

) −℘(PAS

)

℘(PAS

∞

)

× 100, (2)

where ℘(str) indicates the overall performance when

award selection strategy str is used.

The results are plotted in the curves labeled

“PAS

” and “PAS

” in Fig. 1. The task loads over

time are also given in this ﬁgure. The gradated

“javelin” above the graph also illustrates the varied

task loads. The gray area in Fig. 1 indicates the pe-

riod in which the task load exceeds the cumulative

capabilities. These curves indicate that (1) when tl

is low (very few multiple awards occur) or tl is quite

large (over the theoretical limit of cumulative capabil-

ity), PAS

∞

is better than the others (PAS

may worsen

this by 35%), but (2) otherwise PAS

(k = 3 or 6) can

improve the overall efﬁciency by as much as 25%.

This also clearly demonstrates the appropriate degree

of ﬂuctuation depends on the task load, tl. That is,

when tl ≤ 4, PAS

is better than PAS

but vice versa

when 4 < tl ≤ 7.5. Note that the center of the curves

in Fig. 1 have shifted slightly to the left because of

the effect of the delayed execution of tasks queuing

during the overload situation.

4 PROPOSED STRATEGY

4.1 Adaptive Probabilistic Awarding

The experiments discussed in the previous section in-

dicate that the ﬂuctuation factor should adaptively be

controlled according to the system’s task loads to uti-

lize the capabilities of a MMAS. However, it is impos-

sible to assess the system’s task load, because this is

a kind of non-local information. Instead, (Sugawara

et al., 2008a) estimated the task load of the MMAS

from the queue length of contractors. However, this

cannot be simply applied to our case, because if the

queue is long but the costs of queuing tasks are small,

agents cannot conclude whether the system is busy.

ICAART 2010 - 2nd International Conference on Agents and Artificial Intelligence

168

Our idea to this issue is to estimate situations by

statistically analyzing the bid values from local con-

tractors. More precisely, we used the differences be-

tween the standard deviations (SDs) of bid values

for different tasks that had different costs. Assume

that, for announced task t, manager m received bids

whose values are B

(t) = {b

(t),b

(t),. .. }, and the

SD of B

(t) is denoted by SD

(t). Let D

(T ) be

|SD

) − SD

)|, when T = {t

}. Figure 2

shows how the average values and the standard de-

viation of D

(T ) vary over time.

0 20000 40000 60000 80000 100000 120000 140000 160000

Standard deviations of D (

)

Average values of D (

)

Figure 2: Average values and SDs of D

(T ) over time.

Comparing Figs. 1 and 2, D

(T ) can be used as

the key to determining the degree of ﬂuctuation; more

precisely, the ﬂuctuation factor k is established by us-

ing the following strategy,

k = ∞ if D

(T ) ≥ 12.0,

k = 6 if 12.0 > D

(T ) ≥ 8.8, (3)

k = 3 if D

(T ) < 8.8.

This is called the variable award strategy and is de-

noted by VAS in this paper. The aim of this strategy

is to combine the best in strategies, PAS

∞

, PAS

, and

PAS

The results of performance for VAS, I

CNP

(VAS),

are also indicated by the curve labeled “VAS” in

Fig. 1. All the curves in the ﬁgure clearly indicate

that VAS can usually provide better overall perfor-

mance than other individual strategies. The improve-

ment ratios are particularly large just before the task

load reaches the theoretical limit of MMAS and right

after the contractors overcome the overload caused by

the huge number of queuing tasks. We believe that

this characteristic is important and will be discussed

in Section 4.4.

4.2 Learning Probabilistic Awarding

Although strategy VAS can provide better perfor-

mance to an MMAS, it assumes a number of ﬁxed

threshold values as shown in Eqs. (3). We propose

that agents learn these threshold values for applying

it to other tasks as follows: First, manager m ﬁrst cal-

culates the SDs of bid values for each t

∈ T and the

maximum difference between these SDs. This is de-

noted by D

(T ). Manager m also retains the max-

imum and minimum values of D

(T ) (denoted by

maxSDdiff, and minSDdiff ), thus far. Then, m esti-

mates the current task load using maxSDdiff, minS-

Ddiff, and D

(T ). We will call the award strategy

according to this algorithm the adaptive probabilistic

awarding strategy, or AAS after this.

We will investigate whether AAS can provide the

good performance comparable to VAS for task T =

} and whether it also provides the acceptable

overall performances for other tasks that have differ-

ent task structures.

Improvement ratios I

CNP

(AAS) over time is plot-

ted in Fig. 3. The duration of this experiment was

double that of the previous experiment accomplished

by repeating it twice because agents had to learn the

maximum and minimum differences between the SDs

of bid values for individual subtasks. The changes

in task loads are also illustrated as gradated javelins.

Improvement ratios I

CNP

(PAS

) and I

CNP

(VAS) have

also been shown as benchmarks. Figure 3 indicates

that AAS can performs as efﬁciently as VAS. Note

that the performance of AAS is slightly lower than

VAS only in the beginning (from 0-K to 20-K ticks),

because the learning of threshold values T h

and T h

is not sufﬁcient.

4.3 Applying Strategy to Other Tasks

Finally, we have to show whether or not the proposed

strategy can provide the better performance for tasks

with other cost structures. The AAS strategy relies on

the difference of costs of subtasks, we examined the

case when a task has the different cost ratio, other than

2500:500. Instead, we set the sum of the costs of these

tasks to 3000, in order to standardize the theoretical

upper limit number of task executions by all agents.

Let us denote the costs of subtasks as super-

scripts. For example T

25−3−2

= {t

} means

(γ(t

),γ(t

)) = (2500, 300,200). Thus, the task

used in the previous experiments is denoted by T

25−5

The results are plotted in Fig. 4. Note that, because

we intended to compare their performance under AAS

and PAS

∞

, we ﬁxed the changes in task loads over

time in those experiments. Figure 4 indicates that

the overall performance for T

25−3−2

20−10

18−12

20−8−2

and T

15−8−5−2

under AAS are generally

better than those under PAS

∞

. Of course, VAS is not

applicable to these tasks that have different task struc-

tures.

PROBABILISTIC AWARD STRATEGY FOR CONTRACT NET PROTOCOL IN MASSIVELY MULTI-AGENT

SYSTEMS

169

-20

-15

-10

-5

0 40000 80000 120000 160000 200000 240000 280000 320000

time (ticks)

Improvement ratio (%)

VA S

PAS

AAS

Figure 3: Improvement ratios of AAS compared with PAS

and VAS over time.

-10

-5

0 40000 80000 120000 160000 200000 240000 280000 320000

15-8-5-2 20-8-2 20-10 22-8 18-12

time (ticks)

Improvement ratio (%)

TTT

Figure 4: Improvement ratios of AAS for various tasks.

4.4 Cases for Simpler Tasks

We have to discuss two simpler cases in which (1) the

costs of all subtasks are quite similar and (2) each sub-

task is allocated to one of disjoint sets of contractors.

In these cases, we believe that the simple extension of

(Sugawara et al., 2008a; Sugawara et al., 2008b) can

be applied, because the queue length reﬂects the task

loads (or more precisely, multiple awarding because

of task congestion) in these cases. It is also clear

that when a task cannot be divided into subtasks, the

method in (Sugawara et al., 2008a; Sugawara et al.,

2008b) is applicable without any extension.

sectionDISCUSSION The improvement in the pro-

posed algorithm was maximum just before the task

load reached the theoretical upper limit and right after

the contractors overcame the overload caused by the

huge number of queuing tasks. These corresponded

to situations during 50 to 75-K ticks, 105 to 120-

K ticks, 210 to 235-K ticks and 265 to 280-K ticks

in Figs. 3 and 4. We want to emphasize that this is

one of its quite important characteristics; an MMAS

should perform at its full potential near the theoretical

limit. If the task load is quite low, any task allocation

strategy can provide satisfactory services. However,

if it is extremely heavy and over the theoretical limit,

no strategy can accomplish acceptable performance.

In other situations, the system must yield maximum

performance. Our experimental results revealed that

the proposed strategy can provide excellent improve-

ments in these situations.

Note that the centers of the curves in Figs. 3 and

4 are slightly shifted to left. Moreover, if we look at

Fig. 4 more carefully, the values of improvement ra-

tios at around 100-K ticks and 260-K ticks peaked on

the upper of lower direction (but these are excessive

values). While the task load was over the cumulative

capabilities of MMAS, their queues became longer.

After the extremely busy situation, their lengths were

getting shorter and they returned to unbusy states. Of

course, the better strategy can execute tasks more ef-

ﬁciently, thus can enable all agents to return to un-

busy states a little earlier. This results in the peaks to

lower direction but their differences is not signiﬁcant

because the PAS

∞

is better when the task load is the

extremely busy and under AAS, all managers intended

to adopt the same strategy.

As the communications link-bandwidth of the In-

ternet is getting increasingly broader, the bottleneck

in network computing has shifted to nodes (hosts) in

which many tasks are done. Here, it is important that

tasks should be distributed to appropriate servers that

can complete them in the shortest time. Communica-

tion delay, on the other hand, decreases and becomes

insigniﬁcant. However, our experiments showed that

if many managers determine the contractors accord-

ing to bids from local contractors, a high-performance

contractor is likely to be selected as the best bidder;

thus, tasks are concentrated on this contractor. There-

fore, some degree of ﬂuctuation has beneﬁcial effects

on enabling this concentration to be avoided. If we

more carefully consider the reason for the concentra-

tion, it must be multiple awarding due to the small

communication delay. As the network broadens, the

delay decreases, but we cannot eliminate the latency

caused by the ﬁnite speed of light and the ﬁnite pro-

ICAART 2010 - 2nd International Conference on Agents and Artificial Intelligence

170

cessing speed of the switching fabric. Our experi-

ments suggest that this small delay signiﬁcantly af-

fects the overall performance of busy MMASs. Re-

cent research has focused on overlay networks that

reﬂect the application-layer links between agents and

ignore the physical network topology. However, this

is insufﬁcient in an MMAS to elicit agents’ capabili-

ties. It is necessary to take into account the physical

topology in designing an MAS to minimize interfer-

ence among agents.

Although the proposed method can provide bet-

ter performance than the naive CNP, it is possible that

more tailored controls can improve more for compli-

cated tasks. In addition, from Fig. 4, the duration

in which the improvement ratios I

CNP

(AAS) are large

moves to slightly less busier situations if |T | is larger.

We believe that this is caused by the increased chance

of multiple awards. Our experiments show that CNP

in an MMAS exhibits quite different features from

CNP in a small-scale MAS and thus a simple strat-

egy like the naive CNP cannot elicit the potential ca-

pability of MMASs. More research is needed for this

purpose; this paper represents the ﬁrst step toward this

direction.

5 CONCLUSIONS

We proposed a probabilistic award strategy in CNP

for a massively MAS to elicit the potential capabili-

ties of all agents. In this strategy, a manager agent (a)

announces subtasks, (b) statistically analyzes the bids

for each of these, (c) estimates the current local task

load, and (d) introduces an adaptive degree of ﬂuc-

tuation in the award phase. We then experimentally

demonstrated that this strategy provides considerably

better performance than the naive CNP.

In this paper, we focused on CNP because it is

the well-known protocol, but CNP is not the only ap-

proach to task allocation. It is necessary to investi-

gate other protocols (with some modiﬁcation) or cre-

ate a new protocol for busy MMAS. This is our future

research. Meanwhile, when there are many agents,

it is crucial to consider the inter-agent structure like

(Gaston and desJardins, 2005; Abdallah and Lesser,

2007). Thus, to investigate the relationship between

network structures and the performance of the CNP

in MMASs is also another future research issue.

REFERENCES

Abdallah, S. and Lesser, V. (2007). Multiagent Reinforce-

ment Learning and Self-Organization in a Network

of Agents. In Proceedings of the Sixth International

Joint Conference on Autonomous Agents and Multi-

Agent Systems, pages 172–179, Honolulu. IFAAMAS.

Foster, I. (2002). What is the grid? a three point checklist.

Grid Today, 1(6).

Gaston, M. E. and desJardins, M. (2005). Agent-organized

networks for dynamic team formation. In Proceed-

ings of 4th Int. Joint Conf. on Autonomous Agents and

Multiagent Systems (AAMAS2005), pages 230–237.

Gu, C. and Ishida, T. (1996). Analyzing the Social Be-

havior of Contract Net Protocol . In de Velde, W. V.

and Perram, J. W., editors, Proceedings of 7th Euro-

pean Workshop on Modelling Autonomous Agents in a

Multi-Agent World (MAAMAW 96), LNAI 1038, pages

116 – 127. Springer-Verlag.

Sandholm, T. (1993). An Implementation of the Contract

Net Protocol Based on Marginal Cost Calculations. In

Proceedings of the Eleventh National Conference on

Artiﬁcial Intelligence, pages 256–262.

Smith, R. G. (1980). The Contract Net Protocol: High-

Level Communication and Control in a Distributed

Problem Solver. IEEE Transactions on Computers,

C-29(12):1104–1113.

Sugawara, T., Hirotsu, T., Kurihara, S., and Fukuda, K.

(2007). Performance Variation Due to Interference

Among a Large Number of Self-Interested Agents. In

Proceedings of 2007 IEEE Congress on Evolutionary

Computation, pages 766–773.

Sugawara, T., Hirotsu, T., Kurihara, S., and Fukuda, K.

(2008a). Adaptive Manager-side Control Policy in

Contract Net Protocol for Massively Multi-Agent Sys-

tems. In Proceedings of 7th Int. Joint Conf. on

Autonomous Agents and Multiagent Systems (AA-

MAS2008), pages 1433–1436. IFMAS.

Sugawara, T., Hirotsu, T., Kurihara, S., and Fukuda, K.

(2008b). ”controling contract net protocol by local

observation for large-scale multi-agent systems”. In

Cooperative Information Agents XII (CIA2008), pages

206–220. LNCS 5180, Springer.

Weyns, D., Bouck

e, N., and Holvoet, T. (2006). Gradi-

ent Field-Based Task Assignment in an AGV Trans-

portation System. In Proceedings of 5th International

Joint Conference on Autonomous Agents and Multia-

gent Systems (AAMAS2006), pages 842 – 849.

Xu, L. and Weigand, H. (2001). The Evolution of the Con-

tract Net Protocol. In Proceedings of WAIM 2001, vol-

ume LNCS 2118, pages 257–264.

PROBABILISTIC AWARD STRATEGY FOR CONTRACT NET PROTOCOL IN MASSIVELY MULTI-AGENT

SYSTEMS

171