Download Report

What is Brought by R&D Cooperation?
A Global, Dynamic Analysis
Jeroen Hinloopen∗
Grega Smrkolj∗†
Florian Wagener∗
August, 2011
Preliminary and Incomplete
Abstract
We provide a detailed dynamic, global analysis for a strategic model of
R&D, which allows us to consider the R&D process prior to the production
stage. Contrary to the received literature, we do not have to assume that
initial marginal costs are below the choke price. Our global analysis yields
four distinct scenarios, depending on initial marginal cost to be below or
above the choke price and whether or not the R&D process leads to the
steady state; if not, the market is either not formed at all or R&D efforts
are progressively scaled down, implying that the technology will die out
over time while production is eventually terminated. We then compare
two different possibilities across these scenarios: firms cooperate in R&D
only or they extend this cooperation also to the product market. This
yields two main results. First, the set of initial marginal costs above the
choke price that will induce a stable equilibrium is larger if firms also
collude in the product market. Second, the set of initial marginal costs
that induces firms to abandon the technology in time is larger if firms
cannot collude in the product market. We also discuss the consequences
of these results for the rigour with which antitrust policies should be
assessed.
Keywords: Bifurcations, Cartel, Cooperation, R&D, Spillovers
JEL: D43, D92, L13, L41, O31, O38
1
Introduction
The possible trade-off between static and dynamic efficiency remains a subject of
debate. One reason for a consensus view not to have surfaced is the ambiguous
relation between competition intensity on product markets and the incentives
to invest in Research and Development (R&D) (see for a recent overview of this
debate Schmutzler, 2009). The another reason is the wide array of models that
are used to analyze the problem. There is a fundamental distinction between
static and dynamic models. Early contributions focussed on patent races (Reinganum, 1981) or on the strategic aspects of R&D investments (d’Aspremont
∗ University
of Amsterdam and Tinbergen Institute
author. University of Amsterdam, Amsterdam School of Economics,
Roetersstraat 11, 1018 WB Amsterdam, The Netherlands. E-mail address: g.smrkolj@uva.nl
† Corresponding
1
and Jacquemin, 1988). Several recent contributions analyze dynamic versions
of well-known static models (Cellini and Lambertini, 2009, Lambertini and Mantovani, 2009, Kov´
aˇc et al., 2010). In this paper we also present a dynamic model
of R&D. The distinguishing feature of our approach is that we provide a truly
global analysis.
A widely implemented policy response to the possible conflict between static
and dynamic efficiency is to allow independent firms to cooperate in R&D while
remaining competitors on the ensuing product market.1 This preserves the
benefits of competition while at the same time removes obstacles for R&D investments. Because of the public goods character of R&D, the market is believed
to typically provide too little incentives for investing in R&D. Allowing firms to
form R&D cooperatives internalizes, at least to some extent, the concomitant
positive externalities. However, as Scherer (1980) observes: “the most egregious
price fixing schemes in American history were brought about by R&D cooperatives.” Martin (1995) shows that, indeed, the formation of R&D cooperatives
makes it easier to sustain tacit collusion in the product market: “common assets create common interests, and common interests make it more likely that
firms will non-cooperatively refrain from rivalrous behavior.”2 Supported by a
prevalently declinatory position of economic theory towards market collusion,
the authorities of both the USA and EU have made market cartels explicitly
illegal.3
The competition authorities are therefore faced with some kind of a trade-off
between static and dynamic efficiency. Whereas the extension of cooperation in
R&D to that in the product market may reduce output, without it, incentives
to pursue R&D may be lower. In this paper we examine to what extent the
fear that firms cooperating in R&D may also seek to optimize their production
plans is justified.4
Recently, Hinloopen, Smrkolj, and Wagener (2011) have warned that received models, which require that marginal costs are below the choke price at
any point in time and thereby impose that investments at all time coexist with
1 National Cooperation Act passed by Congress in the USA in 1984 allows firms to cooperate
in R&D (research joint ventures are explicitly evaluated on a rule of reason basis) provided this
cooperation is not extended to product markets. The EU competition law has been even more
sympathetic to joint research projects as joint ventures are exempt from antitrust restrictions
and have often even been initiated and subsidized by different EU programs and government
bodies.
2 That cooperation in R&D may to an extent blur competition can also be gathered from
occasional press releases by corporative managers. For instance, when asked with whom they
compete in the upstream market, Andrew N. Liveris, the CEO of Dow Chemical, answered
how “...Frankly it is not very clear anymore, because I’m joint-venturing with all of them.”
(The Focus (2007)).
3 In the USA, price-fixing conspiracies are banned by the Sherman Act. Under EU competition law, market collusion is banned by Articles 85 and 86 of the Treaty of Rome. Some
exceptions like those for agreements that facilitate technological progress are provided. However, these should not completely restrict competition and very few such exemptions were
granted in practice by the European Commission.
4 Geroski (1992) argues that it is the feedback from product markets that directs research
towards profitable tracks and that, therefore, for an innovation to be commercially successful
there must be strong ties between marketing and development of the product. Jacquemin
(1988) observes that R&D cooperatives are fragile and unstable. He reasons that when there
is no cooperation in the product market, there exist a continuous fear that one partner in
R&D may be strengthened in such a way that he will become a dangerous competitor in the
product market. Preventing firms from collaborating in the product market may therefore
destabilize or prevent many valuable R&D agreements.
2
production, cannot by definition capture the very nature of industry dynamics
as this assumption is obviously violated in the early stages of the innovation
process. In this paper we extend their dynamic framework describing the R&D
investments of a monopolist to allow for two potentially competing firms. A
distinguishing feature of our analysis, compared to the received literature, then
is that we consider a much larger parameter space. In particular, we do not
rule out a priori all those cases where the initial marginal costs are above the
choke price. Static models are forced to impose the assumption of ‘low enough’
marginal costs; dynamic versions of these models maintain this assumption without further justification. However, research has started long before a prototype
sees the light; development begins long before the launch of a new product. It
is then only natural to take this period into account when setting up a dynamic
model of R&D. Indeed, the use of proper bifurcation theory allows us to model
the R&D process prior to the production stage for all possible initial technologies. In particular, it yields a bifurcation diagram that tells us for every possible
parameter combination the qualitative features of any equilibrium. Our global
analysis yields four distinct scenarios: (i) initial marginal cost are above the
choke price and the R&D process is initiated, after some time yielding production for which marginal cost continue to fall with subsequent R&D investments;
(ii) initial marginal cost are above the choke price and the R&D process is not
initiated, yielding no production at all; (iii) initial marginal cost are below the
choke price and the R&D process is initiated, directly leading to production
whereby marginal cost continues to fall over time, and (iv) initial marginal cost
are below the choke price and the initiated R&D process is progressively scaled
down, implying that the technology will die out over time while production is
eventually terminated.
We then compare two different possibilities across these scenarios; one in
which firms form a full cartel, and one in which firms collude in R&D only.
We find that the range of initial marginal costs that leads to production is
larger when firms collude in both R&D and production. Furthermore, the set
of initial marginal costs that induces firms to abandon the technology in time is
larger if firms cannot collude in the product market. The speed at which firms
abandon non-prospective technologies is larger as well if firms cannot collude in
the product market.
Our results are not without policy implications. For designing an optimal
antitrust policy it is important to understand that these policies not only affect
current markets, but also markets that are not yet visible. In particular, preventing firms from colluding in the product market reduces the number of R&D
trajectories that successfully lead to new markets. In itself this constitutes a
welfare loss.
The remainder of the paper is organized as follows. Section 2 presents the
model. Section 3 describes all possible equilibria in case firms collude in both
R&D and on the product market, and analyzes the properties of the global
equilibrium dynamics. Section 4 derives all equilibria in case firms collude in
R&D only, which are confronted with the full collusion case in Section 5. Section
6 concludes.
3
2
The model
Time t is continuous: (t ∈ [0, ∞)). There are two a priori fully symmetric firms
which both produce a homogenous good at constant marginal costs. In every
instant, market demand is:
p(t) = A − qi (t) − qj (t),
(1)
where p(t) is the market price, qi (t) is the quantity produced by firm i = {1, 2},
qj (t) is the quantity produced by its rival (i 6= j), and A is the choke price.
Each firm can reduce its marginal cost by investing in R&D. That is, we
consider process innovations which bring about reductions in production costs.
The R&D process is subject to technological spillovers.5 Firm i exerts R&D
effort ki (t) and as a consequence of these investments, its marginal cost evolves
over time as follows:
dci (t)
≡ c˙i (t) = ci (t) (−ki (t) − βkj (t) + δ) ,
dt
(2)
where kj (t) is the R&D effort exerted by its rival and where β ∈ [0, 1] measures
the degree of spillover. The parameter δ > 0 is the constant rate of decrease
in efficiency due to the ageing of technology. Both firms start with the same
technology: ci (0) = cj (0) = c(0). The cost of R&D efforts per unit of time is
for firm i given by:
Γi (ki (t)) = b(ki (t))2 ,
(3)
where b > 0 is inversely related to the cost-efficiency of the R&D process. Hence,
the R&D process exhibits decreasing returns to scale.6 The firms discount the
future with the same constant rate ρ > 0. The instantaneous profit of firm i is:
πi (qi , qj , ki , ci ) = (A − qi − qj − ci )qi − bki2 ,
(4)
yilding total discounted profit:
Z
Πi (qi , qj , ki , ci ) =
∞
πi (qi , qj , ki , ci )e−ρt dt.
(5)
0
The model has five parameters: A, β, b, δ, and ρ. The analysis can be
simplified by considering a rescaled version of the model which, as defined in
Lemma 1, carries only three parameters: β, φ, and ρ˜. That is, rescaling the
model reduces the five-dimensional parameter space into a three dimensional
one.
Lemma 1. By choosing the units of t, qi , qj , ci , cj , ki , and kj appropriately,
we can assume A = 1, b = 1, and δ = 1. This yields the following rescaled
version of the model:
π
˜i (˜
qi , q˜j , k˜i , c˜i ) = (1 − q˜i − q˜j − c˜i )˜
qi − k˜i2 ,
(6)
5 These spillovers are the result of industrial espionage, common sources of basic research
(e.g, universities), and other various ways a firm can learn about its competitor’s research
experience (e.g, by hiring competitor’s employees). Their extent depends on the strength of
intellectual property protection and on the ability of firms to protect their knowledge.
6 In assuming decreasing returns to scale, we follow the bulk of the literature. See Hinloopen, Smrkolj, and Wagener (2011) for a discussion.
4
˜ i (˜
Π
qi , q˜j , k˜i , c˜i ) =
Z
∞
˜
π
˜i (˜
qi , q˜j , k˜i , c˜i )e−ρ˜t dt˜
(7)
0
c˜˙i = c˜i 1 − k˜i + β k˜j φ ,
with
ci =
c˜i (0) = c˜0 ,
q˜i ≥ 0,
k˜i ≥ 0
ρ˜ > 0,
φ>0
c˜i ∈ [0, ∞) ∀ t˜ ∈ [0, ∞)
conversion rules defined as: qi = A˜
qi , qj = A˜
qj , ki = √Ab k˜i ,
˜
A
A˜
ci , cj = A˜
cj , πi = A2 π
˜ i , π j = A2 π
˜j , φ = δ √
, t = δt , ρ˜ = ρδ .
b
(8)
(9)
(10)
kj =
A ˜
√
k ,
b j
The rescaled version of the model introduces a new parameter φ, which
captures the efficiency of the R&D process: a higher b implies that each unit
of R&D effort costs a firm more, whereas a higher δ implies that each unit of
R&D effort decreases the marginal cost by less. Therefore, a higher (lower) φ
corresponds to a more (less) efficient R&D process. For notational convenience
we henceforth omit tildes in the notation.
3
R&D and market cooperation (Full Cartel)
In this section we assume that each firm operates its own R&D laboratory
and production facility, however, firms adopt a strictly cooperative behavior
in determining both their R&D efforts and output levels. These assumptions
amount to imposing a priori the symmetry conditions ki (t) = kj (t) = k(t) and
qi (t) = qj (t) = q(t). Equation (8) thus reads as7 :
c˙ = c(1 − (1 + β)φk).
(11)
The profit of each firm in every instant is:
π(q, k, c) = (1 − 2q − c)q − k 2 ,
yielding its total discounted profit over time:
Z ∞
Π(q, k, c) =
π(q, k, c)e−ρt dt.
(12)
(13)
0
The optimal control problem of the two firms is to find controls q ∗ and k ∗ that
maximize the profit functional Π subject to the state equation (11), the initial
condition c(0) = c0 , and two boundary conditions which must hold at all times:
q ≥ 0 and k ≥ 0.8 Note that according to (11), if c0 > 0, then c(t) > 0 for all t.
The set of all possible states at each time t is given by c ∈ [0, ∞).
7 It may seem reasonable to assume that when cooperating in this way, firms fully share
their information, that is, that the level of spillover is at its maximum (β = 1). For the sake
of the generality of a solution, we do not a priori fix the value of β. Intuitively, this way
we allow for incomplete complementarity between the results of firms’ R&D activities that
may occur despite otherwise complete information sharing (e.g., there might still be some
ex post duplication or substitutability in R&D outputs if firms maintain their laboratories
separated). This way, we more properly define β as the level of complementarity between the
R&D outputs, and not solely as the level of bare information sharing.
8 Note that due to symmetry maximizing the per-firm total profit is the same as maximizing
the two firms’ combined total profit.
5
To solve the dynamic optimization problem, we introduce the current-value
Pontryagin function9
P (c, q, k, λ) = (1 − 2q − c) q − k 2 + λc (1 − (1 + β)φk) ,
(14)
where λ is the current-value co-state variable. It measures the marginal worth
of the increment in the state c for a firm in the cartel at time t when moving
along the optimal path. As marginal cost is a ‘bad’, we expect λ(t) ≤ 0 along
optimal trajectories.
Pontryagin’s Maximum Principle states that if the pair (c∗ , q ∗ , k ∗ ) is an
optimal solution, then there exists a function λ(t) such that c∗ , q ∗ , k ∗ , and λ
satisfy the following conditions:
1.) q ∗ and k ∗ maximize the function P for each t:
P (c∗ , q ∗ , k ∗ , λ) = max 2 P (c∗ , q, k, λ),
(q,k)∈R+
(15)
2.) λ is a solution to the following co-state equation:
−
∂P
= λ˙ − ρλ
∂c
⇔
λ˙ = q + (ρ − 1 + (1 + β)φk) λ,
(16)
which is evaluated along with the equation for the marginal cost
c˙ = c(1 − (1 + β)φk)
(17)
and the initial condition c(0) = c0 .
Let Q(c, λ) and K(c, λ) solve the problem max(q,k) P (c, q, k, λ) for every
(c, λ). We define the current-value Hamilton function H(c, λ) = P (c, Q(c, λ), K(c, λ), λ).
The above necessary conditions (15)-(16) for an optimal solution are complemented by the transversality conditions:10
lim e−ρt H(c, λ) = 0
(18)
lim e−ρt λc = 0.
(19)
t→∞
and
t→∞
If problem (15) has a solution, this solution necessarily satisfies the following
Karush-Kuhn-Tucker conditions:
∂P
= 1 − 4q − c ≤ 0,
∂q
q
∂P
= 0,
∂q
(20)
∂P
∂P
= −2k − (1 + β)φλc ≤ 0,
k
= 0.
(21)
∂k
∂k
Conditions (20) imply that either i) q = 0 and c ≥ 1 or ii) q > 0 and c < 1.11
In particular,
(
(1 − c)/4 if c < 1
q∗ =
(22)
0
if c ≥ 1
9 Also
called pre-Hamilton or un-maximized Hamilton function.
necessity of (18), which allows exclusion of non-optimal trajectories, was proven by
Michel (1982). For the necessity of transversality condition (19), see Kamihigashi (2001).
11 Observe that in the original (non-rescaled) model, the analogous condition for positive
production is c(t) < A.
10 The
6
Conditions (21) imply that either i) k = 0 and
k > 0 and ∂P
∂k = 0. In particular,
(
λc
if
− (1+β)φ
∗
2
k =
0
if
∂P
∂k
≤ 0 (implying λ ≥ 0) or ii)
λ≤0
λ>0
.
(23)
The above conditions for the optimal production yields two regimes of the
state-control systems. The first one is characterized by positive production, in
the second one there is no production.
1. The system with positive production. In the region where c < 1, using
(16), (17), (22), and (23), we obtain the following optimal state and co-state
dynamics:
(
c˙ = c 1 + 21 (1 + β)2 φ2 λc
(24)
λ˙ = 1 (1 − c) + ρ − 1 − 1 (1 + β)2 φ2 λc λ,
4
2
12
where λ ≤ 0. Equation (23) depends on the co-state and state variable. Note
that the correspondence between k and λ is one-to-one if, and only if, λ ≤ 0,
and that therefore the system can be rewritten in the state-control form.
Differentiate equation (23) with respect to time to obtain the dynamic equation
(1 + β)φ ˙
dk
≡ k˙ = −
λc + λc˙ .
(25)
dt
2
Substitute λ˙ from (16) and c˙ from (17). Note that equation (23) also implies
that
2k
λ=−
.
(26)
(1 + β)φc
Substitute this expression into (25), together with the expression for the optimal
output level (q = (1 − c)/4), to obtain:
(1 + β)φ
k˙ = ρk −
c(1 − c).
8
(27)
Hence, the system with positive production (c < 1) in the state-control form
consists of the following two differential equations:
(
k˙ = ρk − (1+β)φ
c(1 − c)
8
(28)
c˙ = c (1 − (1 + β)φk)
2. The system with zero production. In the region c ≥ 1, substituting (16),
(17), and (26) into (25) and imposing q = 0, gives the following expression:
k˙ = ρk.
(29)
Hence, the state-control system with zero production consists of the following
two differential equations:
(
k˙ = ρk
(30)
c˙ = c (1 − (1 + β)φk)
12 The
differential equations in the main text are all valid for λ ≤ 0. For a complete
specification of optimal state and co-state dynamics (including the case when λ > 0), we refer
the reader to Appendix E.
7
The state-co-state analogue is:
(
c˙ = c 1 + 12 (1 + β)2 φ2 λc
λ˙ = ρ − 1 − 1 (1 + β)2 φ2 λc λ
(31)
2
The Hamilton function, obtained by substituting (23) and appropriate expressions for q from (22) into the Pontryagin function (14), is given by
(
H(c, λ) =
1
8 (1
− c)2 + 14 (1 + β)2 φ2 λ2 c2 + λc
1
4 (1
2 2 2 2
+ β) φ λ c + λc
if
c ∈ [0, 1)
if
c ∈ [1, ∞),
(32)
where λ ≤ 0. The state-control analogue is given by
H(c, k) =



1
8 (1
2
− c)2 + k k − (1+β)φ
2
k k − (1+β)φ
if
c ∈ [0, 1)
if
c ∈ [1, ∞),
(33)
where k ≥ 0.
3.1
Steady-state solutions
The steady-state solutions to (28) and (30) are obtained by imposing the stationarity conditions k˙ = 0 and c˙ = 0.
Lemma 2. The ‘no-production’ state-control system (30) has no steady-state
in the region where it is defined.
Proof. See Appendix B.
Consider now system (28). Imposing the stationarity condition k˙ = 0, we obtain
kF C =
(1 + β)φ
c(1 − c) ≥ 0
8ρ
∀c ∈ [0, 1],
(34)
where superscript FC stands for Full Cartel.13 Steady-state marginal cost follows from inserting (34) into the state dynamics (17) and imposing the stationarity condition c˙ = 0:
(1 + β)2 φ2
c˙ = c 1 −
c(1 − c) = 0.
(35)
8ρ
This yields:
cF C = 0,
where
1
V =
2
cF C =
s
1
± V,
2
(36)
32ρ
.
(37)
(1 + β)2 φ2
√
Observe that V is real if, and only if, φ ≥ 32ρ/(1+β), in which case V ∈ [0, 21 )
and cF C ∈ [0, 1). The solution to the system is summarized in Proposition 1
and is depicted in Figure 1.
13 All
1−
steady-state values have superscript FC.
8
√
Proposition 1. If φ > 32ρ/(1 + β), the state-control system with positive
production (28) has three steady states:
i) (cF C , k F C ) = (0, 0) is an unstable node,
1
is either an unstable node or an unstable foii) (cF C , k F C ) = 21 + V, (1+β)φ
cus, and
1
iii) (cF C , k F C ) = 21 − V, (1+β)φ
is a saddle-point steady state.
√
At φ = 32ρ/(1 + β), a so-called “saddle-node bifurcation” occurs as the last
two steady states coincide and form a so-called “semi-stable” steady state.
√
If φ < 32ρ/(1 + β), the state-control system with positive production has one
single steady state: the origin (cF C , k F C ) = (0, 0), which is unstable.
Proof. See Appendix C.
SN
12
3 steady
states
10
Β=0
8
Φ
Β = 0.5
Β=1
6
4
Hc, kL = H0, 0L is the only steady state
2
0
0
1
2
3
4
5
Ρ
Figure 1: Steady states of the state-control system. The
√ drawn curves are the
saddle-node bifurcation (SN) curves given by φ = 32ρ/(1 + β). They are
drawn for three fixed values of β. Passing through them from above, the two
steady states other than origin coincide, form a ”semi-stable” steady state, and
then disappear.
In the saddle-point steady state, instantaneous marginal cost, output, and profits of each firm are, respectively:
p
(1 + β)φ − (1 + β)2 φ2 − 32ρ
FC
(38)
c
=
2(1 + β)φ
p
(1 + β)φ + (1 + β)2 φ2 − 32ρ
qF C =
,
(39)
8(1 + β)φ
9
π
3.2
FC
p
(1 + β)2 φ2 − 16(1 + ρ) + (1 + β)φ (1 + β)2 φ2 − 32ρ
.
=
16(1 + β)2 φ2
(40)
Global Optimality & Industry Dynamics
(Bifurcations of optimal vector fields)
The stable manifold of the saddle-point steady state is one of the candidates
for an optimal solution. However, as the following remark shows, it may not
(always) be the true optimum.
Remark 1. The Pontryagin function of the optimization problem as given in
(14) is not jointly concave in the state and control variables. Hence, necessary
conditions for an optimum are not necessarily sufficient.
Proof. See Appendix D.
Therefore, we have to carry out a detailed bifurcation analysis14 to assess
the dependence of the solution structure on the model parameters. This yields
a complete overview of how optimal R&D investment and concomitant production respond to changes in the initial level of marginal cost and to varying
characteristics of the R&D process. To analyze global optimality, we have to
inspect the complete state-control space, which is the set of solutions (c(t), k(t))
of the state-control system considered as parameterized curves (trajectories) in
a plane. Its general form is sketched in Figure 2.
The dotted vertical line c = 1 separates the region with zero production
from the region with a positive level of production. In the left-side region, the
parabola which achieves its maximum at c = 1/2 is the locus k˙ = 0. The
1
loci representing points where c˙ = 0 are the horizontal line with height (1+β)φ
and the k-axis. Loci c˙ = 0 and k˙ = 0 intersect in the saddle-point steady
state (S2 ) and the two unstable steady states (S1 and S3 ). The black arrows
in squares indicate the direction of trajectories in each respective region. The
horizontal arrows indicate an increase or a decrease in c, whereas the vertical
ones refer to changes in k. A number of trajectories is indicated by grey arrows.
Together, the trajectories cover the entire space. However, there are only two
trajectories leading to the saddle-point steady state; these are called “stable
paths” or “stable manifolds”. They are indicated by the connected black arrows
pointing towards S2 . The lighter arrows pointing away from S2 indicate the two
unstable paths. Trajectories spiral out from S3 in a counter-clockwise motion.
The solution to the state-control system (30), which gives trajectories in the
region with zero production, is given by:
k(t)
= C1 eρt
c(t)
= C2 e
eρt (1+β)φC1
t−
ρ
(41)
,
(42)
where C1 and C2 are positive real constants.
14 The variations in the values of parameters typically lead to qualitative changes in the
solution structure (e.g., some steady states lose their stability, indifference points appear).
Such qualitative changes in the solution structure due to smooth variations in parameters are
called bifurcations. For a good introduction to bifurcation theory and precise definitions of
terms used in this section, see Grass et al. (2008). For a recent discussion, see, for instance,
Kiseleva and Wagener (2009).
10
q>0
q=0
k˙ = 0
k
L1
L2
S2
S
1
(1+β)φ
3
c˙ = 0
L3
S1
0
0.5
1
1.5
c
Figure 2: Illustrative sketch of the state-control space.
A solution to the state-control system (28), which yields trajectories in the
region with positive production, cannot be obtained analytically; we therefore
make use of geometrical-numerical techniques.15
A particular trajectory is a candidate for an optimal solution if it satisfies all
necessary conditions: those given by Pontryagin’s Maximum Principle together
with transversality conditions (18) and (19). As Figure 2 shows, there exists a
whole range of solutions (c(t), k(t)) to the state-control systems. These trajectories all satisfy the conditions of Pontryagin’s Maximum Principle. By ruling
out those trajectories that do not satisfy the transversality conditions, we are
left with the candidates for an optimal solution.
Lemma 3. The points along the c-axis in Figure 2 in the set
{(c, k) : c ∈ (0, 1), k = 0}
cannot be a part of any optimal trajectory.
Proof. See Appendix E.
Lemma 4. All trajectories along which k → ∞ and c → 0 as t → ∞ can be
ruled out as optimal solutions.
Proof. See Appendix F.
For instance, the trajectory denoted by L1 in Figure 2 is in the set of
Lemma 4.
Lemma 5. The trajectory through the point (c, k) = (1, 0) satisfies the transversality conditions (18) and (19).
15 See
also [12, 28]. All numerical calculations were done in Matlab.
11
Proof. See Appendix G.
In Figure 2, the trajectory of Lemma 5 is labelled L3 . We call this the “exit
trajectory” as it implies that the firms (eventually) exit the market. Indeed, at
some stage the exit trajectory enters the region where c ≥ 1, at which point
both R&D investment and production come to a halt.16
Notice that the stable path (denoted by L2 in Figure 2) to the saddle-point
steady state (S2 ) also satisfies the transversality conditions. Notably, along this
trajectory, both c and k converge to finite limits and, therefore, so does the
co-state variable (26).
As the following corollary to the above lemmas shows, we are left with two
candidates for an optimal solution.
Corollary 1. The set of candidates for an optimal solution consist of the stable path of the saddle-point steady state and the trajectory through the point
{(c, k) = (1, 0)}.
Proof. See Appendix H.
To find an optimal solution from the candidates, we make use of the following
lemma.
Lemma 6. If the pair (c(·), q(·), k(·)) satisfies the necessary conditions (15)(19), then
Z ∞
Π(c, q, k) =
π(q, k, c)e−ρt dt
0
1
= P (c(0), q(0), k(0), λ(0))
ρ
1
= H (c(0), k(0)) ,
ρ
(43)
where H is the Hamilton function in state-control form.
Proof. See, e.g., Grass et al. (2008), ch. 3, pp. 161-162.
Lemma 6 is useful whenever several paths satisfy the necessary conditions
and we have to compare the corresponding values of the objective (total discounted profit) function in order to determine which one of them is the optimal
solution. As the lemma states, these values are simply equal to the values of
the Hamiltonian (33) at the corresponding initial points.
Figure 3 provides an example in which the optimal solution is the stable
path leading to the saddle-point steady state. Note that the initial marginal
costs can be higher than c = 1. The optimal decision leads the firms to first
produce nothing but to invest increasingly in the reduction of their marginal
costs. When marginal costs are below one, they start producing. The level of
16 Strictly speaking, firms’ marginal costs continue to increase along the exit trajectory due
to the positive depreciation rate δ. However, we interpret any situation in which a firm stays
inactive as if it has left the market.
12
R&D investments then gradually decreases to its long-run steady-state level.17
The value of the total discounted profit function is increasing with decreasing
marginal cost. Clearly, starting with lower initial marginal costs allows each
firm in the cartel to obtain a higher present discounted value of its profit flow
(notice that the profit function is just a rescaled Hamilton function, as stated
in Lemma 6).
Note that having a phase of zero production as a part of the stable path is
possible only if the range of initial marginal costs is not bounded to be below
the choke price. In this phase instantaneous profits are negative, but total
discounted profits are positive; the firms invest in R&D because once marginal
costs are below the choke price, production becomes profitable. Any model
imposing marginal costs to be below the choke price at all times cannot capture
these dynamics.18
0.32
0.25
0.3
0.28
0.2
0.26
0.15
q
k
0.24
0.22
0.2
0.1
0.18
0.16
0.05
0.14
S2
0.12
0
0.2
0.4
0.6
0.8
1
1.2
1.4
0
1.6
0
0.5
1
1.5
2
c
t
(a)
(b)
1.6
1.4
0.32
0.1
0.3
0.09
0.28
0.08
0.26
0.07
0.24
0.06
2.5
3
3.5
4
k
c
1
0.8
0.6
Π
1.2
0.22
0.05
0.2
0.04
0.18
0.03
0.4
0.16
0.2
0
0.02
0.14
0
1
2
3
4
5
t
(c)
6
7
8
9
10
0.12
0.01
0
1
2
3
4
5
t
(d)
6
7
8
9
10
0
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
c
(e)
Figure 3: State-control space (a), time paths for quantity (b), marginal cost (c),
and R&D effort (d), and the correlation diagram between the total discounted
profit and marginal cost (e), respectively. All plots show curves for parameter
values (β, ρ, φ) = (1, 0.5, 4).
17 The shape of the stable path complies well to the empirical evidence of the level of
R&D activities over time. For instance, in his study of the petrochemical industry, Stobaugh
(1988) finds that the level of process innovation is rising over the first years after the product
innovation, whereas it is decreasing over time. Furthermore, he provides evidence that the
probability of the next process innovation being major innovation also decreases over time.
This relates to the concave shape of our stable path which implies that it takes increasingly
more effort over time to reduce marginal cost.
18 Related, the literature on optimal technology adoption predicts a substantial time lag
between the discovery of a new technology and its adoption (see e.g. Hoppe (2002) for an
overview). Delaying the adoption provides additional information as to the profitability of a
new technology; it is also rational if a better technology is expected to become available in the
near future (see also Doraszelski (2004)). In this literature, however, technological progress is
exogenous.
13
In the example of Figure 3, the firms optimally select an appropriate k such
that (c0 , k0 ) is located on the stable path. However, as the Arrow-Mangasarian
sufficiency conditions are not satisfied (see Remark 1), an investment policy that
follows the stable path is not necessarily optimal at all times. Indeed, it could
not even be a possible solution as the saddle-point steady state does not exist
for all parameter values (recall Proposition 1). That is, the solution structure
depends on the system’s parameters.
Figure 4 presents the bifurcation diagram, which shows the optimal R&D
investments for varying parameter values.19 Every combination of the exogenous
parameters yields a point on this diagram. It consists of five distinct regions,
each representing a particular structure of the set of optimal solutions. In
general, either there exists an upper bound of initial marginal cost below which
the firms selects the stable path towards the saddle-point steady state, or the
(new) technology is always phased out by firms (eventually) exiting the market.
Hence, if it exists, convergence to the saddle-point steady state along the stable
path is locally optimal only; it is never optimal globally.
Figure 5 differs from Figure 4 in that in the former β is fixed at 0.1 instead
of at 1. As the two figures indicate, changing β does not qualitatively change
our conclusions, yet it changes them quantitatively. Decreasing β (the degree of
spillovers), pushes the curves upward. This was expected as larger beta implies
larger complementarity of firms’ R&D efforts, which enhances the ability of the
firms to reduce investment costs in circumstances with a convex investment cost
function and so yields a higher return to their R&D.
Solutions to the optimization problem for parameters outside the shaded
region in Figure 4 are characterized by the presence of threshold values; the
stable path of the saddle-point steady state is optimal if the initial marginal
cost is below some threshold level, otherwise the exit trajectory is optimal. The
threshold point can be either an indifference point20 or a repeller. Indifference
points are initial states c = c0 for which the firm is indifferent between several
possible investment policies (i.e., the stable path and the exit trajectory). A
repeller differs from an indifference point in that at a repelling point, it is optimal
to stay at that point forever after.21
In Region I, the point of indifference is above one. Therefore, the system will
converge to the saddle-point steady state for all initial technologies c0 ∈ (0, 1].22
A typical example is depicted in Figure 6. The arrows indicate the direction of
motion along optimal trajectories. The two regions (production, no production)
are split by the dotted vertical line c = 1. For a given ρ and β, Region I is
characterized by a relatively high φ, that is, by relatively large demand and/or
high R&D efficiency. In such a favorable environment, the firms can compensate
for their early stage losses if, intitally, production is postponed as that would
yield negative mark-ups. There exists, however, a finite upper bound cˆ > 1
for the initial marginal costs c0 beyond which future profits are not enough to
19 The
bifurcation diagram in Figure 4 is drawn for the parameters of the non-rescaled model.
called Skiba points or DNSS points (see Grass et al., 2008).
21 More formally, a threshold point separates two basins of attraction: the initial values of
the marginal cost below these points constitute the basin of attraction of the saddle-point
steady-state; the initial values of the marginal cost above these points constitute the basin
of attraction to exit the market. An indifference point lies in both basins of attractions, a
repeller in neither.
22 Note that c = 0 corresponds to an unstable steady state; for this trivial value of the
0
marginal cost, the firms stay at that point forever.
20 Also
14
β=1
12
10
I
II
8
A
√
δ b6
IR
III
SN
ISN
4
IA
2
IV
V SN’
0
0
0.5
1
1.5
2
2.5
ρ/δ
3
3.5
4
4.5
5
Figure 4: Bifurcation diagram for β = 1. The uppermost curve represents
parameter values for which the indifference point is exactly at c = 1. SN
stands for the “saddle-node bifurcation curve”, SN’ for the “inessential saddlenode bifurcation curve”, IA for the “indifference-attractor bifurcation curve”,
and IR for the “indifference-repeller bifurcation curve”. ISN indicates the
“indifference-saddle-node point”, which is the point from which indifferenceattractor, indifference-repeller, saddle-node and inessential saddle-node curves
emanate. It’s coordinates are ρ˜ ≈ 2.14 and φ ≈ 4.1376. Roman numerals (IV) indicate the corresponding parameter regions discussed in the main text.
The shaded region indicates the parameter region for which firms always exit
the market. Notice that the axes are excluded from the admissible parameter
space. The curve representing indifference points at c = 1 obtains a value of
φ ≈ 1.41 for ρ˜ = 1e − 5.
compensate for short-run losses (ˆ
c ≈ 1.75 in the case considered). In that case,
the technology will never enter the market. In Figure 6, the point of indifference
(indicated by a dashed vertical line) occurs where the total profit function for
the stable path (Π1 ) in the right-hand picture obtains a value of zero (which
is the value of the total profit function for the exit trajectory Π2 in the region
c ≥ 1). As the total profit beyond this point is negative (the firms make a net
loss), the firms prefer no investment at all (giving them total profits of zero).
Starting with any initial value of the marginal cost below the indifference point,
the firms’ optimal decision is to follow the stable path. The exact position of
the indifference point is defined in Lemma 7.
Lemma 7. If an indifference point between the stable path and the exit trajectory
does exist in the region with zero production (c ≥ 1), it is given by the value
2
of the marginal cost cˆ ≥ 1 for which the point (c, k) = cˆ, (1+β)φ
lies on the
stable path. For c0 = cˆ, total discounted profits are zero for both trajectories.
15
β=0.1
12
III
IR
I
II
SN
10
8
ISN
A
√
δ b
6
IA
IV
4
SN’
V
2
0
0
0.5
1
1.5
2
2.5
3
ρ/δ
3.5
4
4.5
5
Figure 5: Bifurcation diagram for β = 0.1. The curve representing indifference
points at c = 1 obtains a value of φ ≈ 2.56 for ρ˜ = 1e − 5. The coordinates of
the ISN point are ρ˜ ≈ 2.14 and φ ≈ 7.523.
0.02
0.26
0.24
0.015
0.22
Π1
0.01
Π
k
0.2
0.18
0.005
0.16
Π2
0
0.14
0.12
−0.005
0.1
0.08
S2
0
0,5
1
1,5
1,75
2
−0.01
2,5
c
1
1,5
1.75
2
2,5
c
Figure 6: State-control space and the total profit function when the indifference
point is in the region with zero production. The plots are drawn for parameter
values (β, ρ, φ) = (1, 1, 6).
Proof. See Appendix I.
Region I is bounded from below by the curve representing those points for
which the point of indifference is exactly at c = 1. This particular situation is
depicted in Figure 7. For all initial values of the marginal cost c0 ∈ (0, 1), the
optimal-solution trajectory is the stable path of the saddle-point steady state,
while for c0 ∈ (1, ∞), it is optimal not to produce and invest anything. At c0 =
cˆ = 1, either strategy is optimal. The right-hand picture indicates that the total
profit Π1 for the stable path is zero at exactly c = 1. For comparison, the total
profit Π2 on the exit trajectory, which passes through the point (c, k) = (1, 0),
is plotted up to its conjugate point (defined later). We see that Π2 is smaller
16
than Π1 for c < 1, and it is equal to Π1 at the indifference point c = 1.
0.25
0.09
0.08
Π1
0.07
0.06
0.2
k
Π
0.05
0.04
0.03
0.15
0.02
0.01
Π2
0
S2
0.1
0.2
0.4
0.6
0.8
1
1.2
−0.01
1.4
c
0
0.2
0.4
0.6
0.8
1
1.2
c
Figure 7: State-control space and the total profit functions when the indifference
point is exactly on the boundary between the two regions (ˆ
c = 1). The plots are
drawn for parameter values (β, ρ, φ) = (1, 1, 4.7).
The stable path in the region with zero production (c ≥ 1) is characterized by
a decreasing k and an increasing c as t → −∞ (see (41)-(42) and Figure 6). From
Lemma 7 follows that the indifference point between the stable path and the exit
trajectory is exactly at c = 1 when the stable path reaches k = 2/ ((1 + β)φ)
exactly at c = 1 (see Figure 7). As investment is increasing in φ (for a given ρ and
β), we can find for every ρ and β a high enough φ such that the indifference point
is either at c = 1 or in the region with c ≥ 1. A higher φ for a given ρ and β in
Region I just moves the indifference point further beyond c = 1. Consequently,
Region I is not bounded from the right. Moreover, as k decreases along the
stable path as t → −∞, at some instant the stable path, having a value of k
at c = 1 above k = 2/ ((1 + β)φ)), reaches the point with k = 2/ ((1 + β)φ) for
some c > 1 as t → −∞. Therefore, for parameters in Region I, we always have
an indifference point at some c = cˆ ≥ 1 and, in particular, the convergence to
the saddle-point steady-state is for these parameters guaranteed if c0 ∈ (0, 1],
but not for all c0 ∈ (1, ∞).
In Region II the indifference point cˆ is below c = 1. This implies in particular that at the indifference point the monopolist produces a positive quantity.
Figure 8 illustrates this case. The indifference point (indicated by the dashed
vertical line) occurs where the two respective total profit functions (Π1 for the
stable path, Π2 for the exit trajectory) intersect, as indicated in the right-hand
picture. For the particular parameter values considered in Figure 8, the indifference point occurs at cˆ ≈ 0.59. Although it is also profitable to invest in R&D for
some initial technology level c0 ∈ (ˆ
c, 1), the technology is not promising enough
for the firms to select the stable path towards the saddle-point steady-state.
Rather, they invests in R&D at some smaller rate that retards the decay of the
technology level optimally; but eventually the technology will leave the market,
and the firms with it.23
For an even higher initial level of marginal cost (c0 ≥ 1), the firms do not
initiate any activity; the new technology is not developed at all and the firms
do not enter the market.24
23 Note that if c ∈ (ˆ
c, 1), the alternative strategy of producing but not investing anything
0
in R&D, which corresponds to ‘following’ the c-axis below the exit trajectory in Figure 8,
yields lower total discounted profits than following the exit trajectory (recall Lemma 3).
24 Conjugate points P and P indicate the last element on the respective trajectory (when
1
2
17
0.25
0.06
0.05
Π1
0.2
0.04
0.03
0.15
P1
P2
S3
k
Π
S2
0.1
0.02
0.01
P2
Π2
0
0.05
−0.01
0
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
1.1
−0.02
0.2
1.2
c
P1
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
c
Figure 8: State-control space and the total profit functions when the indifference
point is within the region with positive production. The plots are drawn for
parameter values (β, ρ, φ) = (1, 1, 3.5). The optimal-solution trajectories are
marked by thick lines. The two candidates for the optimal solution, which both
spiral out of the unstable steady state S3 , are the stable path and the exit
trajectory.
Incidently, it is not uncommon to observe continuous incremental improvements of an existing technology while it has been decided that it will be taken
off the market at some future instant.
An interesting example of a dying industry is ice harvesting, which was
destined to disappear after the emergence of machine-made ice. According
to Utterback (1994), the ice harvesters were however still making numerous
improvements in harvesting, storing and delivery processes (e.g., steam-powered
saws and conveyors were introduced), which resulted in greater volume and
lower unit cost of ice. Similarly, even as the door was closing for producers of
gas lamps after the electric light bulb had been invented by Edison, this did
not stop their continuous process innovation. Utterback reports how among
other improvements the Welsbach mantle, which increased the efficiency of gas
lighting by five times, was successfully introduced. Yet another example is the
production of dry plate photography, which was put on the road to extinction
by Kodak roll film cameras. The old technology still found ways of improving
itself (e.g., by introducing self-setting shutters, small plate cameras, celluloid
substitutes for glass). See Utterback (1994) for numerous other examples of
industry dynamics. These examples in which producers in a dying industry
still invest in their products give much support for exit trajectories that are
characterized by positive R&D efforts. As already noted in the discussion of
Region II, our model shows that the ‘exit trajectory investment schedule’ leading
up to eventually discontinuing the product gives a higher total profit than the
investment schedule that is characterized by zero investment along the way of
exiting the market. Hence, theoretical predictions conform well to real world
practice.
Region III is similar to Region II in that for all c ≥ 1 the firm does not
initiate any activity and only for some small enough c ∈ (0, 1) there is converstarting from its end) where the marginal cost is still increasing/decreasing. Considering a
particular trajectory beyond this point would give us two possible k-points for each c, of which
the one beyond the P point is suboptimal. Observe that in the plot of the profits, the value
of the total profit for the exit trajectory (starting from its end) increases all along the way to
P2 (where it obtains a cusp point), beyond which it starts decreasing.
18
gence to the saddle-point steady state. There is, however, a small qualitative
difference between the two regions. Namely, in Region III, the threshold point
is not an indifference point but a repeller. Region III is separated from Region
II by a type-2 indifference-repeller bifurcation curve (IR1 (2)), at which a repeller changes into an indifference point (see Hinloopen, Smrkolj, and Wagener
(2011)).
Region IV√is separated from Region III by a saddle-node bifurcation curve,
given as φ = 32ρ/(1 + β). This curve represents parameter combinations for
which the saddle and the unstable node S3 coincide and form a semi-stable
steady state. After the bifurcation, both the saddle and the unstable node
disappear. In such a case (see Figure 9), the only solution left is the exit
trajectory: for c0 ≥ 1 the firms never initiate any activity, while for c0 ∈ (0, 1)
the firms do invest in R&D but they will leave the market at some future instant.
This is again intuitive as in Region IV we have that φ is relatively small (for a
given ρ and β). That is, R&D is relatively inefficient and/or market demand is
relatively small. Note that in Figure 9 R&D investments are initially increasing
over time for very low values of marginal costs. These investments reduce the
speed at which the firms will leave the market; for low initial marginal cost it
turns out to be profitable to slow down this process in order to profit optimally
from low marginal costs.
0.12
0.12
0.1
0.08
k
Π
0.1
0.08
0.06
0.06
0.04
0.04
0.02
0.02
0
0
0.2
0.4
0.6
0.8
1
1.2
0
1.4
c
0
0.2
0.4
0.6
0.8
1
1.2
1.4
c
Figure 9: State-control space and the total profit function for parameters of the
shaded region. Parameter values are (β, ρ, φ) = (1, 1, 2.5). The exit trajectory
covers the whole space and it is the only solution. Observe that the total
discounted profit is positive for c < 1.
Finally, Region V resembles Region IV in that only the exit trajectory is
optimal. This is in spite of the fact that points in Region V have higher φ
values than points in Region IV with the same value of ρ and β, and that the
saddle-point steady state (S2 ) together with its stable path exists. The exit
trajectory always yields a higher discounted profit flow than the stable path.
Consequently, the saddle-point steady state is not even locally optimal. The
IA1 curve that bounds Region V from above is a so-called indifference-attractor
bifurcation curve (see Hinloopen, Smrkolj, and Wagener (2011) for details).
The curve bounding Region V from below is an inessential saddle-node bifurcation curve (SN01 ). It is similar to the SN1 curve in that two new steady
states appear when crossing it. However, it is inessential for the optimization
problem in the following sense. The bifurcation does not entail a qualitative
change of the optimal strategy, as the steady states involved in the SN01 bifurca19
tion are not associated with the optimal strategy; as above, the exit trajectory
remains the optimal trajectory. Consequently, while the state-control system
does bifurcate, the structure of the optimal solutions does not.
Together, Region IV and Region V (the shaded region) represent the parameter region for which there is never convergence to the saddle-point steady
state because it is either not optimal to approach it (Region V ) or because there
is no saddle-point steady state at all (Region IV ). Moving from the shaded region, where the firms always exit the market, by increasing φ while keeping ρ
and β fixed, we arrive at Region II, where staying in the market is possibly
an optimal solution depending on the initial level c0 of the marginal cost. The
interpretation is straightforward: staying in the market becomes an option if
R&D becomes more efficient and/or if market demand increases.
4
The R&D Cartel
We now turn to analyze the R&D cartel. We still assume that each firm
operates its own R&D laboratory and production facility. However, while
firms select their output levels non-cooperatively, they adopt a strictly cooperative behavior in determining their R&D efforts so as to maximize their joint
profits. These assumptions amount to imposing a priori the symmetry condition ki (t) = kj (t) = k(t).25 As ci (0) = cj (0) = c(0), this implies that
ci (t) = cj (t) = c(t). Equation (8) thus reads as:
c˙ = c(1 − (1 + β)φk).
(44)
The instantaneous profit of firm i is:
πi (qi , qj , k, c) = (1 − qi − qj − c)qi − k 2 ,
(45)
yielding its total discounted profit over time:
Z ∞
Πi (qi , qj , k, c) =
πi (qi , qj , k, c)e−ρt dt.
(46)
0
We look for feedback (also called closed-loop) Nash equilibria. As firms
cooperatively decide on their R&D efforts, the only competitive decisions are
that of production levels. However, as quantity variables do not appear in
the equation for the state variable (44), production feedback strategies of a
dynamic game are simply static Cournot-Nash strategies of each corresponding
one-period game.
Solving maxqi ∈R+ πi gives us standard Cournot best-response functions for
the product market:
(
1
qj < 1 − c,
2 (1 − c − qj ) if
(47)
qi (qj ) =
0
if qj ≥ 1 − c,
25 In this paper we are looking only for symmetric equilibria. However, it is perhaps possible
that joint profits could be increased if firms made unequal investments and produce unequal
amounts. Such an asymmetric strategy would raise investment costs (the R&D cost function
is convex) but might create more than offsetting benefits in the production stage. Salant and
Shaffer (1998) consider this possibility within the framework of a static model of strategic
R&D.
20
where the second part comes from the boundary condition for the non-negative
production levels qi,j ≥ 0. Solving for the Cournot-Nash production levels, we
obtain
(
1
c < 1,
N
3 (1 − c) if
(48)
q =
0
if c ≥ 1.
Consequently, the instantaneous profit of each firm is
(
1
2
2
if c < 1,
9 (1 − c) − k
π(c, k) =
2
−k
if c ≥ 1.
(49)
Then, the optimal control problem of the R&D cartel is to find control k ∗ (R&D
effort of each firm) that maximizes the total discounted joint profit of the two
firms taking into account the state equation (44), initial condition c(0) = 0, and
the boundary condition k ≥ 0 which must hold at all times. Note that according
to (44), if c0 > 0, then c(t) > 0 for all t. The set of all possible states at each
time t is given by c ∈ [0, ∞).26
To solve the dynamic optimization problem, we introduce the current-value
Pontryagin function27
(
1
2
2
c < 1,
9 (1 − c) − k + λc(1 − (1 + β)φk) if
(50)
P (c, k, λ) =
2
−k + λc(1 − (1 + β)φk)
if c ≥ 1.
where λ is the current-value co-state variable of a firm in the R&D cartel. It
measures the marginal worth of the increment in the state c for each firm in
the cartel at time t when moving along the optimal path. As marginal cost is a
‘bad’, we expect λ(t) ≤ 0 along optimal trajectories.
Pontryagin’s Maximum Principle states that if the pair (c∗ , k ∗ ) is an optimal
solution, then there exists a function λ(t) such that c∗ , k ∗ , and λ satisfy the
following conditions:
1.) k ∗ maximizes the function P for each t:
P (c∗ , k ∗ , λ) = max P (c∗ , k, λ),
k∈R+
(51)
2.) λ is a solution to the following co-state equation:
−
∂P
= λ˙ − ρλ
∂c
(52)
which is evaluated along with the equation for the marginal cost
c˙ = c(1 − (1 + β)φk)
(53)
and the initial condition c(0) = c0 .
26 In other words, firms that jointly decide on their R&D efforts know that as the result
of this cooperation, the marginal cost of both firms will be the same and that their product
market profits directly depend on it (recall (49)). Knowing this, they will select such a time
path of R&D efforts, and through them the path of their marginal cost over time, that their
joint profits, which in our symmetric case they share equally, will be the highest possible.
27 We again omit a factor of 2 for combined profits to obtain the solution expressed in
per-firm values.
21
Let k = K(c, λ) solve the problem maxk P (c, k, λ) for every (c, λ). We define the current-value Hamilton function H(c, λ) = P (c, K(c, λ), λ). The above
necessary conditions (51)-(52) for an optimal solution are complemented by the
transversality conditions:
lim e−ρt H(c, λ) = 0
(54)
lim e−ρt λc = 0.
(55)
t→∞
and
t→∞
If problem (51) has a solution, this solution necessarily satisfies the following
Karush-Kuhn-Tucker condition:
∂P
= −2k − (1 + β)φλc ≤ 0,
∂k
Conditions (56) imply that either i) k = 0 and
k > 0 and ∂P
∂k = 0. In particular,
(
− (1+β)φ
λc
if
2
k∗ =
0
if
k
∂P
∂k
∂P
= 0.
∂k
(56)
≤ 0 (implying λ ≥ 0) or ii)
λ ≤ 0,
λ > 0.
(57)
The two-part composition of the Pontryagin function leads us to differentiate
between two regimes of the state-control system. The first one is characterized
by positive production, whereas the second one corresponds to no production.
1. The system with positive production. In the region c < 1, using (52), (53),
and (57), we obtain the following optimal state and co-state dynamics:
(
c˙ = c 1 + 21 (1 + β)2 φ2 λc ,
(58)
λ˙ = 2 (1 − c) + ρ − 1 − 1 (1 + β)2 φ2 λc λ,
9
2
where λ ≤ 0. Equation (57) depends on the co-state and state variable. Note
that the correspondence between k and λ is one-to-one if, and only if, λ ≤ 0,
and that therefore the system can be rewritten in the state-control form. Differentiate equation (57) with respect to time to obtain the dynamic equation
dk
(1 + β)φ ˙
≡ k˙ = −
λc + λc˙ .
dt
2
(59)
Substitute λ˙ from (52) and c˙ from (53). Note that equation (57) also implies
that
2k
λ=−
.
(60)
(1 + β)φc
Substitute this expression into (59) to obtain:
(1 + β)φ
k˙ = ρk −
c(1 − c).
9
(61)
Hence, the system with positive production (c < 1) in the state-control form
consists of the following two differential equations:
(
k˙ = ρk − (1+β)φ
c(1 − c),
9
(62)
c˙ = c (1 − (1 + β)φk) .
22
2. The system with zero production. In the region c ≥ 1, substituting (52),
(53), and (60) into (59) gives us the following expression:
k˙ = ρk.
(63)
Hence, the state-control system with zero production consists of the following
two differential equations:
(
k˙ = ρk,
(64)
c˙ = c (1 − (1 + β)φk) .
The state-costate analogue is:
(
c˙ = c 1 + 21 (1 + β)2 φ2 λc ,
λ˙ = ρ − 1 − 1 (1 + β)2 φ2 λc λ.
(65)
2
The Hamilton function, obtained by substituting (57) into the Pontryagin function (50), is given by
(
H(c, λ) =
1
9 (1
− c)2 + 14 (1 + β)2 φ2 λ2 c2 + λc
1
4 (1
2 2 2 2
+ β) φ λ c + λc
if
c ∈ [0, 1),
if
c ∈ [1, ∞),
(66)
where λ ≤ 0. The state-control analogue is then
H(c, k) =



1
9 (1
2
− c)2 + k k − (1+β)φ
2
k k − (1+β)φ
if
c ∈ [0, 1),
if
c ∈ [1, ∞),
(67)
where k ≥ 0.28
4.1
Steady-state solutions
The steady-state solutions to (62) and (64) are obtained by imposing the stationarity conditions k˙ = 0 and c˙ = 0.
28 Our closed-loop solution differs from that of Cellini and Lambertini (2009), who consider
the case when marginal cost is always lower than the choke price. This is so because their
proof that the open-loop and closed-loop solutions coincide is flawed by the fact that in
their derivation of the closed-loop solution, players’ output choices are not properly treated
as functions of the state variable. The derivations of the authors implicitly assume that
if marginal cost within the R&D cartel changes, the opponent’s quantity does not change,
which is the violation of the feedback principle underlying the closed-loop solution. It is also
counterintuitive as firms in the R&D cartel are supposed to jointly decide on their R&D effort
taking into account that marginal cost in any period affects the Nash-equilibrium profit in the
product market. This profit is a function of the cartel’s marginal cost and reflects changes
in outputs of both players. Our calculations also show that the authors’ solution leads to
situations in which the profit of the R&D cartel can be higher than that of the full cartel.
We shall question any such a result as the full cartel can in principle always do what a lower
form of cooperation (R&D cartel) does if this brings about a higher profit. We therefore stick
to our closed-loop solution, which does not suffer from this oddity, for all future comparisons
between regimes.
23
Lemma 8. The ‘no-production’ state-control system (64) has no steady state
in the region where it is defined.
Proof. See Appendix B.
Consider now system (62). Imposing the stationarity condition k˙ = 0, we
obtain
(1 + β)φ
k RC =
c(1 − c) ≥ 0
∀c ∈ [0, 1],
(68)
9ρ
where superscript RC stands for R&D Cartel.29 Steady-state marginal cost
follows from inserting (68) into the state dynamics (53) and imposing the stationarity condition c˙ = 0:
(1 + β)2 φ2
c˙ = c 1 −
c(1 − c) = 0.
(69)
9ρ
This yields:
cRC = 0,
where
cRC =
1
± V,
2
(70)
s
36ρ
.
(71)
(1 + β)2 φ2
√
Observe that V is real if and only if φ ≥ 6 ρ/(1 + β), in which case V ∈ [0, 12 )
RC
and c
∈ [0, 1). The solution to the system is summarized in Proposition 2.
√
Proposition 2. If φ > 6 ρ/(1 + β), the state-control system with positive production (28) has three steady states:
1
V =
2
1−
i) (cRC , k RC ) = (0, 0) is an unstable node,
1
is either an unstable node or an unstable foii) (cRC , k RC ) = 21 + V, (1+β)φ
cus, and
1
iii) (cRC , k RC ) = 12 − V, (1+β)φ
is a saddle-point steady state.
√
At φ = 6 ρ/(1 + β), a so-called “saddle-node bifurcation” occurs as the last two
steady states collide and form a so-called “semi-stable” steady state.
√
If φ < 6 ρ/(1 + β), the state-control system with positive production has one
single steady state: the origin (cRC , k RC ) = (0, 0), which is unstable.
In the saddle-point steady state, instantaneous marginal cost, output, and
profit of each firm are, respectively:
p
(1 + β)φ − (1 + β)2 φ2 − 36ρ
RC
c
=
,
(72)
2(1 + β)φ
p
(1 + β)φ + (1 + β)2 φ2 − 36ρ
q RC =
,
(73)
6(1 + β)φ
p
(1 + β)2 φ2 − 18(1 + ρ) + (1 + β)φ (1 + β)2 φ2 − 36ρ
RC
π
=
.
(74)
18(1 + β)2 φ2
29 All
steady-state values have superscript RC.
24
5
Market Collusion and Incentives to Innovate:
Full Cartel and R&D cartel confronted
The graphs showing different kinds of industry dynamics, each corresponding
to one of the five distinct regions of the bifurcation diagram, are for the R&D
cartel qualitatively the same as the ones for the full cartel.30 The bifurcation
diagram for the R&D cartel is together with its full cartel counterpart depicted
in Figure 10. Observe that the bifurcation curves for the full cartel lie below
those for the R&D cartel, and this so over the entire parameter space. The
relatively lower position of the shaded region (bounded above by the saddlenode and indifference-attractor bifurcation curves) for the full cartel leads to
our first main result of the global comparisons of the two regimes.
Result 1. For a given discount rate, it takes a larger market size and/or
efficiency or R&D to make staying in a market a possibility for firms in the
R&D cartel than it does for firms in the full cartel.
This result stems from the fact that in the shaded region, the exit trajectory
is the only solution. What is especially interesting to notice is the relative
position of the curves indicating the point of indifference being exactly at c = 1.
The curve for the full cartel lying below its R&D cartel counterpart implies the
existence of the situation in which, for a given parameter combination, the full
cartel regime has an indifference point at c > 1, while in the R&D cartel regime
the indifference point is at some c < 1. This relates to our second main result.
Result 2. Whenever there is a threshold value of initial marginal cost in
both regimes, this value is for the full cartel strictly larger than for the R&D
cartel, for every point in the parameter space.
The above result implies that, for a given market market size and efficiency
of R&D, the full cartel is ex ante more likely to bring a particular technology
to a production phase and less likely to exit the market. This fact is further
discerned by Result 3 and Result 4 below.
Result 3. The set of initial marginal costs that will induce the formation
of a new market is larger if besides cooperating in R&D, firms also cooperate in
the product market.
We already noted that the point of indifference in the region with c ≥ 1 is an
important concept. It determines the highest value of the initial marginal cost
for which the firms still initiate investment. If the firms find out that the initial
marginal cost (reflecting the starting level of the technology) is higher than this
upper bound, they do not go into development and the particular technological
idea is left unrealized - the market is not formed.
Figure 11 shows that this indifference point is for the full cartel at the higher
value of the initial marginal cost (c20 ) than for the R&D cartel (c10 ). Hence,
technologies corresponding to the marginal costs between the two indifference
30 Remark
1, Lemmas 3 - 7, and Corollary 1 apply also to the R&D cartel. Corresponding
proofs are analogous to their full cartel counterparts. Any differences between the proofs are
indicated within appendices corresponding to the full cartel.
25
25
I
20
II
15
IR
A(1+β)
√
δ b
III
SN
10
ISN
IA
IV
5
V
0
0
SN’
0.5
1
1.5
2
2.5
ρ/δ
3
3.5
4
4.5
5
Figure 10: Bifurcation diagram. The bifurcation curves for the R&D cartel
(blue) are drawn together with the bifurcation curves for the full cartel (red).
points (in the interval of a nontrivial size (c10 , c20 )) are the ones that are developed
only if besides cooperating in R&D, firms also cooperate in the product market.
The intuition is as follows. The firms are prepared to incur initial losses only
if they know they will make large enough profits to compensate for these losses
once the production has been initiated. The higher the value of the initial
marginal cost, the more investments it takes to reduce the marginal cost to the
level where the production can start (below the choke price) and so the larger
the initial losses that have to be compensated for. As the firms in the full cartel
are able to extract larger profits from the product market than the firms in
the R&D cartel (which earn only competitive profits), the firms engaged in the
full cartel can compensate for larger initial losses than the firms cooperating in
R&D only.31
As Figure 11 shows, the total discounted profit of the R&D cartel is for
c0 ∈ (c10 , c20 ) zero as firms don’t optimally do anything.32 However, the total
profit of the full cartel is positive. Hence, the full collusion appears endogenously
as a voluntary agreement through which both firms expect to benefit. Moreover,
as the plots in the figure show, this agreement is beneficial also from the social
standpoint as without it, consumers, as well as firms, are left empty-handed
(total discounted profit, consumer surplus, and total surplus are all zero for the
R&D cartel when c0 ≥ c10 ).
31 The assumption that firms know the market demand before the production stage has even
started may seem strong. We could relax it by postulating that firms form expectations about
future demand. However, while adding to the complexity of the model, this would not change
our conclusions as long as these expectations are assumed to be identical between the two
regimes; and we can not think of any good reason for not assuming so.
32 If the firms started developing the product, they would in total make a loss.
26
1
0.5
0.9
0.45
0.8
0.7
0.4
Π
k
0.6
0.35
0.5
0.4
0.3
0.3
0.2
0.25
0.1
0.2
0
0.5
1
c
c10
1.5
2
c20
0
2.5
0
0.5
1
(a)
c
c1
1.5
2
1.5
2
0
2
c0
2.5
(b)
4
2.2
3.5
2
3
1.8
2.5
TS
CS
1.6
1.4
2
1.5
1.2
1
1
0.8
0.5
0.6
0
0
0.5
1
c
c10
1.5
2
c2
2.5
0
0.5
1
c
0
(c)
c10
c20
2.5
(d)
Figure 11: State-control space (a), total discounted profit (b), consumer surplus
(c), and total surplus (d), when the indifference point is in the region with zero
production. The plots are drawn for parameter values (β, ρ˜, φ) = (1, 0.1, 2.25).
Blue curves correspond to the R&D cartel, whereas the red ones correspond to
the full cartel.
Figure 12 illustrates Result 3 further by comparing the position of the indifference point for different parameter combinations. The indifference point is for
the R&D cartel always below the one for the full cartel. For a given discount
rate, increasing the market size and/or the efficiency of R&D, increases the value
of the indifference point and so broadens the range of initial marginal costs that
lead firms to select the stable path. It also increases the discrepancy between
the two regimes (∆2 cˆ > ∆1 cˆ); the more so, the lower the discount rate (and so
the higher the slope of the curves). This is intuitive. The surplus created by
R&D that can be appropriated by the firms in the future (once the production
has started) is for the full cartel larger than for the R&D cartel. And the lower
the discount rate, the higher the value of this future surplus for firms.
Figure 12 also shows that increasing the discount rate, for a given market
size and efficiency of R&D, pushes the indifference point downwards (from C1
to C2 ) as it reduces the value of the future profits. This increase in the discount
rate can in principle be offset by sufficiently increasing the market size and/or
the efficiency of R&D, as indicated in the figure.
At Figure 11 we showed that for c0 ∈ (c10 , c20 ), the technology is developed
only by the full cartel. Assuming c0 = 2 (observe that 2 ∈ (c10 , c20 )), Figure 13
shows how marginal cost and price are continuously decreasing over time to their
27
10
9
ρ/δ = 0.1
ρ/δ = 0.5
8
7
cˆ
6
5
ρ/δ = 1
∆2 cˆ
C1
4
3
2
1
2
∆1 cˆ
4
C2
6
8
10
12
14
16
A(1+β)
√
δ b
Figure 12: Dependence of the indifference point cˆ on model parameters. Curves
are drawn for three fixed values of ρ˜. Curves for the R&D cartel (dotted) lie
below the curves for the full cartel (full).
long-run steady-state levels (0.0412 and 0.5206, respectively). We also plot the
so-called Lerner Index which measures a divergence of price from marginal cost
and is traditionally used as a short-cut estimator of the monopoly power and, as
prevalently claimed, its corresponding welfare loss. Its value is increasing over
time to the ‘high’ long-run level of 0.92. This increasing discrepancy between
price and marginal cost is a consequence of the cartel increasing its mark-up
over time to compensate for investments made before the production stage as
well as to cover its current R&D expenses. This is clear from the plot of the
instantaneous profit against the total discounted profit in Figure 13.
Observe that it is only after a while that the instantaneous profit becomes
positive. Its potential high size combined with a high Lerner Index may raise
eyebrows of antitrust economists worrying about resource misallocation. We
should note two things. First, while the instantaneous profit could well be
judged high, focusing on it, one overlooks huge R&D expenditures made over
years (let alone all the mental effort). Notably, the total discounted profit is
much lower: 0.0131 in our case (see Figure 13). In fact, the latter will be closer
to zero, the closer to the indifference point the initial marginal cost is. Second, at
Figure 12 we saw how market characteristics that affect future production (and
so profitability) affect the position of the indifference point. Hence, if market
regulations reduce mark-ups (for instance, by dismantling the product market
cartel), the range of initial technologies that are developed further shrinks (c20 ↓).
It can then well happen that in our case 2 > c20 and the firms are not able to
cover the costs already incurred in R&D. One should note that in our example,
the full cartel created the market and did not just take over the preexisting
28
2
0.05
1.8
Π
1.6
0
1.4
−0.05
1.2
LI
1
π
−0.1
0.8
p
0.6
−0.15
0.4
−0.2
0.2
c
0
0
2
4
6
8
10
12
14
16
18
−0.25
20
0
2
4
6
8
10
t
t
(a)
(b)
12
14
16
18
20
Figure 13: Marginal cost, price, and Lerner Index (a); total discounted profit and
instantaneous profit (b), for the full cartel. The plots are drawn for parameter
values (β, ρ˜, φ) = (1, 0.1, 2.25) and starting point c0 = 2.
one. Had firms not been able to cooperate also in the product market, the
market would not even have been formed (recall that 2 > c10 ). Hence, the
Lerner Index appears misleading from a correct dynamic perspective as it clearly
cannot indicate ‘social inefficiencies’ in any meaningful way. As shown, (larger)
deviations of price from marginal cost are not a source of inefficiencies but a
requirement for technology to be developed and a market to be formed at all.33
Result 4. The set of initial marginal costs that induce firms to abandon the
technology in time is larger if firms cannot collude in the product market.
Result 4, corresponding to the indifference point in the region with positive
production, is illustrated in Figure 14. For all initial marginal costs in interval
(c10 , c20 ), the firms engaged in the R&D cartel gradually exit the market, whereas
the firms engaged in the full cartel continue to reduce marginal costs and stay in
the market. The revenue the firms in the R&D cartel can potentially earn in the
product market is lower than that of the full cartel. Consequently, the relatively
large expenses needed to stay in the market (to countervail depreciation) become
for the R&D cartel ‘too high’ sooner than for the full cartel. That is, the full
cartel is more resistant to strained investment circumstances.34
33 In the literature, the Lerner Index has been subject to several critiques. Pindyck (1985)
claims that characteristics of dynamic markets (e.g., technological change, innovation, learning
by doing) cause the Lerner Index to give a distorted picture of a firm’s monopoly power. The
idea that deviations of price from marginal cost may arise from the need to cover incurred
investment costs can be traced back to Lindenberg and Ross (1981):“...[The Lerner Index]
does not recognize that some of the deviation of P from MC comes from ... the need to
cover fixed costs and does not contribute to market value in excess of replacement cost” (p.
28). The argument that marginal pricing may be neither feasible nor desirable in technologydriven industries has been occasionally recognized by courts, e. g., in US v. Eastman Kodak
(1995), where the court concludes that “Kodak’s film business is subject to enormous expenses
that are not reflected in its short-run marginal costs”. See Elzinga and Mills (2011) for a
comprehensive survey of limitations and qualifications of the index. For a critical discussion
of numerous antitrust cases in business history, see Armentano (1999).
34 Strained in the sense of a relatively high depreciation rate in comparison with the market
size and efficiency of R&D.
29
0.5
1.2
0.45
0.4
1
0.35
0.8
Π
k
0.3
0.25
0.6
0.2
0.4
0.15
0.1
0.2
0.05
0
0
0.1
0.2
0.3
0.4
0.5
c10
0.6
0.7
0.8
0.9
0
1
0
0.1
0.2
0.3
0.4
c20
c
0.5
0.6
0.7
0.8
0.9
1
0.6
0.7
0.8
0.9
1
c
(a)
(b)
2.5
4.5
4
2
3.5
3
TS
CS
1.5
2.5
2
1
1.5
1
0.5
0.5
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0
1
0
0.1
0.2
0.3
0.4
0.5
c
c
(c)
(d)
Figure 14: State-control space (a), total discounted profit (b), consumer surplus (c), and total surplus (d), when the indifference point is within the region with positive production. The plots are drawn for parameter values
(β, ρ˜, φ) = (1, 0.1, 2). Blue curves correspond to the R&D cartel, whereas the
red ones correspond to the full cartel. Full curves correspond to the stable path,
whereas the dotted ones to the exit trajectory. Dots indicate the saddle-point
steady state.
Result 5. When the stable path to the saddle-point steady state is optimal
in both regimes, the steady-state value of marginal cost is for the full cartel lower
than for the R&D cartel. The full cartel always invests more in R&D, be it along
the stable path or the exit trajectory.
From the fact that for every given marginal cost, the trajectory for the full
cartel is characterized by a higher R&D effort (see Figure 11, 14, and 15), the
following result follows.
Result 6. For a given initial value of marginal cost, firms in the full cartel
enter production phase earlier than firms in the R&D cartel. On the contrary,
the speed at which firms abandon non-prospective technologies is lower when
firms also collude in the product market than it is when they do not.
As Result 5 states, the full cartel always achieves a higher production efficiency than does the R&D cartel when they both follow the stable path.35
35 This
follows directly from comparing the steady-state value of the marginal cost of the
30
However, when both regimes follow the stable path, the partial cartel is socially preferred to the full cartel as the higher efficiency brought about by the
full cartel is not relatively higher enough to compensate for the inefficiency
arising by its relatively lower output, and it therefore does not translate into
either higher consumer surplus or total surplus. Likewise, when both regimes
are about to exit the market, socially the R&D cartel is preferred. While the
full cartel indeed abandon production relatively more slowly36 (Result 6 ), this
does not outweigh its relatively lower instantaneous production. Consequently,
both consumer surplus and total surplus are lower for the full cartel (see also
Figure 15).
0.1
0.14
0.09
0.12
0.08
0.1
0.07
0.08
Π
k
0.06
0.05
0.06
0.04
0.03
0.04
0.02
0.02
0.01
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0
1
0
0.1
0.2
0.3
0.4
0.5
c
c
(a)
(b)
0.6
0.7
0.8
0.9
1
0.6
0.7
0.8
0.9
1
0.45
0.25
0.4
0.2
0.35
0.3
TS
CS
0.15
0.25
0.2
0.1
0.15
0.1
0.05
0.05
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0
1
0
0.1
0.2
0.3
0.4
0.5
c
c
(c)
(d)
Figure 15: State-control space (a), total discounted profit (b), consumer surplus
(c), and total surplus (d), when the exit trajectory is an optimal solution. The
plots are drawn for parameter values (β, ρ˜, φ) = (1, 1, 2). Blue curves correspond
to the R&D cartel, whereas the red ones correspond to the full cartel.
However, as Figure 14 shows, the full cartel brings about a higher total
full cartel with that of the R&D cartel (equations (38) and (72), respectively). The reason for
this difference is that while both regimes indeed exert the same R&D effort in the steady state
(the level needed to countervail depreciation), the investment paths of regimes are different.
The full cartel invests more in R&D as it is able to appropriate a higher surplus resulting
from it. This result stands in stark contrast to a strand of literature which claims that a lack
of competition in the product market will through a lack of investment motivation lead firms
to rest content with old methods and be engaged in x-inefficiency (a term first introduced by
Leibenstein (1966).
36 The intuition for this is that, having a monopoly power, the full cartel has still more to
extract from the market and therefore has incentives to leave slower.
31
surplus and also consumer surplus for initial marginal costs in interval (c10 , c20 ).
Recall that for these costs, the R&D cartel eventually exits the market, while
the full cartel follows the stable path and so stays in the market. Continuous
reductions in marginal costs along the stable path by the full cartel eventually
translate into a relatively higher output and in the end also into a higher total
discounted consumer surplus and total surplus. These benefits of the full cartel,
accruing from increasing efficiency over time, diminish with increases in the
discount rate. As Figure 16 indicates, for a relatively higher discount rate, the
R&D cartel can (for some subset of (c10 , c20 )) lead to a higher total surplus when
market size and/or R&D efficiency are sufficiently small even though the firms
engaged in it are destined to exit the market. With a lower market size and
efficiency of R&D, the full cartel itself invests less along the stable path and
thereby brings about lower gains in efficiency; and this future gains are of a
lower value, the higher the discount rate.
0.1
0.2
0.08
0.06
TS
TS
0.15
0.1
0.04
0.05
0.02
0
0.6
0.65
0.7
0.75
0
0.8
0.2
0.25
0.3
0.35
0.4
0.45
c
c
(a)
(b)
0.5
0.55
0.6
0.65
Figure 16: Total surplus for the R&D cartel (blue curves) and full cartel (red
curves) for two different combinations of parameter values: (β, ρ˜, φ) = (1, 1, 4)
in the left figure and (β, ρ˜, φ) = (1, 1, 3.14) in the right figure. Full curves
correspond to the stable path, whereas the dotted ones to the exit trajectory.
Dots indicate the saddle-point steady state.
6
Discussion and concluding remarks
Great benefits of R&D cooperations proven both in theory and practice have led
policy makers not to treat such initiatives as per se violations of the antitrust
law (as it holds for price-fixing agreements) but to consider them on a rule-ofreason basis. A possible extension of cooperation in R&D to that in the product
market has been considered as something that must be prevented at all costs.
In their seminal paper, d’Aspremont and Jacquemin (1988) conclude, within
their static model of R&D, that while the amount of R&D in the case of collusion
in both R&D and product market is higher than in the case of R&D cooperation
only, the quantity produced is still lower. Petit and Tolwinski (1999), who
consider a dynamic model of R&D, conclude the same. Moreover, a higher price
under full collusion leads in their model to a reduction in a consumer surplus
and to the lowest total surplus among regimes considered. Consequently, the
32
authors conclude that “...[full collusion] is socially inferior to other forms of
industrial structures” and “...thus, antitrust legislation is important.” (p. 206).
Our global analysis shows that the received literature suffers from neglecting
the very possibility that the initial marginal cost is higher than the choke price.
By assuming that a particular product is already in the production stage and
this so irrespective of the market regime under consideration, this literature
disregards an important part of the development phase. Indeed, it assumes an
existence of a situation whose formation the true theory of R&D should explain.
The practice shows that firms do not pursue further all conceived technological
ideas at all, but rest their decision upon their (expected) profitability.37 Yet,
this profitability is affected not only by the (expected) demand for a product
but also by the current and (expected) future market structure. The more
competitive the latter, the lower the price, the lower the mark-up, and hence
the lower the profitability of the idea.
We have shown that collusion in the product market through a larger profit
enlarges the set of initial marginal costs for which the firms pursue further development of corresponding technologies. Preventing collaboration in the product
market reduces this set.38 As Hinloopen, Smrkolj, and Wagener (2011) points
out “the loss of total surplus this reduction brings about is a hidden cost of competition policies if the indifference point is above c = 1, as in that case there is
initially no production. Not pursuing the development of the technology does
consequently not surface as an immediate reduction in total surplus. Yet, these
indirect costs should be taken into account when assessing properly the trade-off
between static and dynamic efficiency.” We have also shown that the popular
Lerner Index gives a distorted picture of welfare losses as market concentration
with its ensuing future high product market prices might in fact be needed for
the market to be formed at all. The practice of antitrust prosecutions however
shows that antitrust authorities widely neglect the exposed trade-off. As reported by Armentano (1999) in his comprehensive exposition of past antitrust
cases, “...price-fixing agreements were always illegal and there was to be no rule
of reason with regard to such conspiracies” (p. 143) and “...whether prices have
actually been fixed, or whether they have been fixed at unreasonable levels, has
been immaterial [at a federal court]” (p. 147).
In this paper we have focused on two market regimes (full cartel and R&D
cartel), leaving aside, in particular, a regime where firms compete both on the
product market and in their R&D endeavors. The technical demands of its
global analysis prevent us from including it here. We think it is also interesting
to try to include shochasticity, endogenous choice of spillovers, separate investments into innovative R&D on one hand and imitative R&D on the other hand,
asymmetric R&D abilities and differentiated products in the global framework
we developed here. Furthermore, we abstracted from potential new entry into
the market. These are in our opinion some of the fruitful areas we leave for
37 Elmer Bolton, a scientist-manager at the DuPont company, one of the most innovative
corporations in American business history, was famous for saying to company’s chemists who
in his opinion lacked the awareness that the success of the company depends on its products
being commercially exploitable: “This is very interesting chemistry, but somehow I don’t hear
the tinkle of the cash register.” (Hounshell and Smith, 1988).
38 Notice that the probability of being detected by the antitrust authorities when engaged
in collusion can be accounted for as an increase in a firm’s discount rate (as it is standard in
the literature). We have shown in Figure 12 that a higher discount rate reduces the value of
marginal cost at the indifference point.
33
future research.
34
Appendices
A
Proof of Lemma 1
A rescaled variable or parameter is distinguished by a tilde: for instance, if π
denotes profit, then π
˜ denotes profit in rescaled variables. The profit function
in the original (non-rescaled) model is:
πi = (A − qi − qj − ci ) qi − bki2
Using the conversion rules given in Lemma 1, we obtain:
πi
=
(A − qi − qj − ci ) qi − bki2
=
(A − A˜
qi − A˜
qj − A˜
ci ) A˜
qi − b
= A2 (1 − q˜i − q˜j − c˜i ) q˜i − k˜i2
A
√ k˜i
b
2
= A2 π
˜i
The equation for the evolution of marginal cost over time is in original variables
given by:
c˙i (t) = ci (t) (−ki (t) − βkj (t) + δ) .
Write ci (t) = ci 1δ t˜ . Then,
dci
dt˜
=
=
dci dt
dt dt˜
c˙i
δ
= ci 1 −
kj
ki
−β
δ
δ
.
Setting ki = √Ab k˜i and kj = √Ab k˜j , and substituting them in the previous equation, we obtain:
A dci
= ci 1 − k˜i + β k˜j √ .
dt˜
δ b
A
It is now natural to introduce φ = δ√
.
b
Note that if c˜i = ci /A, then c˜˙i = c˙i /A and
c˜˙i = c˜i 1 − k˜i + β k˜j φ
˜
Observe finally that if t = δt , then e−ρ˜t˜ = e−ρt if and only if ρ˜ = ρδ .
B
Proof of Lemma 2 and Lemma 8
As ρ > 0, the equilibrium condition k˙ = ρk = 0 implies that k = 0. But then
c˙ = c (1 − (1 + β)φk) = c. This is equal to 0 if and only if c = 0. But this
cannot be as in the system with zero production we have c ≥ 1.
35
C
Proof of Proposition 1
√
Assume φ > 32ρ/(1 + β). The stability of the steady states can be analyzed
by evaluating the trace and determinant of the following Jacobian matrix:39
" ∂ c˙ ∂ c˙ # "
#
1 − (1 + β)φk −(1 + β)φc
∂c
∂k
=
JFC =
.
(75)
∂ k˙
∂ k˙
− (1+β)φ(1−2c)
ρ
8
∂c
∂k
At c = k = 0, the trace τ of the matrix J F C is given as
def
τ = tr J F C = 1 + ρ > 0,
its determinant ∆ is
def
∆ = det J F C = ρ > 0,
and its discriminant D is
def
D = τ 2 − 4∆ = (1 − ρ)2 > 0.
Hence, this steady state is an unstable node.
Evaluating the Jacobian matrix at
(1 + β)φ
c(1 − c),
k=
8ρ
1
1 1
c= +V = +
2
2 2
s
1−
32ρ
,
(1 + β)2 φ2
(76)
we obtain τ = ρ > 0 and
p
p
(1 + β)2 φ2 − 32ρ (1 + β)φ + (1 + β)2 φ2 − 32ρ
∆
=
(77)
16
1 1
= (1 + β)2 φ2 V
+ V ,
(78)
8 4
√
which is clearly positive if φ > 32ρ/(1 + β). Hence, this steady state is also
unstable. The discriminant takes the value
p
1
D = ρ(8 + ρ) − (1 + β)φ (1 + β)φ + (1 + β)2 φ2 − 32ρ ,
(79)
4
q
2ρ(8+ρ)2
which is zero for φ = φ0 = (1+β)
2 (4+ρ) , negative for φ > φ0 , and positive for
√
32ρ/(1 + β) ≤ φ < φ0 . The steady state is an unstable node if D > 0 and an
unstable focus if D < 0. In the latter case, the eigenvalues of J F C are complex
conjugates with positive real parts.40
39 Note that the trace of the respective Jacobian matrix is equal to the sum of eigenvalues,
while its determinant is equal to their product. If the real part of each eigenvalue is negative,
then the steady state is asymptotically stable. If the real part of at least one of the eigenvalues
is positive, then the steady state is unstable. More particular, if one eigenvalue is real and
positive and the other one real and negative, the steady state is said to be a saddle. In this
last situation, there are four special trajectories, the separatrices, two of which are forward
asymptotic to the saddle, while the other two are backward asymptotic to it. The union the
former separatrices with the saddle form the stable manifold of the saddle; analogously, the
union of the latter with the saddle form the unstable
manifold.
√
1
40 Note that the eigenvalues of J F C are r
2
1,2 = 2 τ ± τ − 4∆ .
36
Finally, evaluating the Jacobian matrix at
(1 + β)φ
k=
c(1 − c),
8ρ
1
1 1
c= −V = −
2
2 2
s
1−
32ρ
,
(1 + β)2 φ2
(80)
we obtain τ = ρ > 0 and
(1 + β)2 φ2 − 32ρ − (1 + β)φ
∆=
16
p
(1 + β)2 φ2 − 32ρ
.
(81)
√
If φ > 32ρ/(1+β), then ∆ < 0, and the eigenvalues are real and have opposite
sign. Therefore,
(80) is a saddle-point steady state of the system. Observe that
√
for φ = 32ρ/(1 + β), the steady states (76) and (80) coincide at c = 1/2 and
1
k = (1+β)φ
. A saddle-node bifurcation occurs at these parameter values, where
the two equilibria coincide and disappear.41
Substituting the expression for the steady-state marginal cost of the steady
states other than the origin into (34), the expression for the optimal investment
1
.
in the steady state simplifies to k F C = (1+β)φ
D
Proof of Remark 1
The Hessian matrix of the Pontryagin function with respect to control and state
variables is

 2
∂ P (·)
∂ 2 P (·)
∂ 2 P (·)


2
−4
0
−1
∂q∂k
∂q∂c

 2∂q

 P (·) ∂ 2 P (·) ∂ 2 P (·)  
2
−2
−(1 + β)φλ 
D(q,k,c)
P =  ∂∂k∂q
= 0
∂k2
∂k∂c 

∂ 2 P (·)
∂ 2 P (·)
∂ 2 P (·)
−1 −(1 + β)φλ
0
2
∂c∂q
∂c∂k
∂c
(82)
The leading principal minor of the first order is −4. The leading principal
minor of the third order, which is also the determinant, of the above matrix is
equal to 2 + 4(1 + β)2 φ2 λ2 . As the leading principal minors of odd order do
not have the same sign, the matrix is indefinite. Consequently, the Pontryagin
function is nowhere jointly concave in state and control variables. Hence, the
Arrow-Mangasarian sufficiency conditions are not satisfied.42
For the R&D cartel, the appropriate Hessian matrix of the Pontryagin function is
 2
 "
#
∂ P (·)
∂ 2 P (·)
−2
−(1 + β)φλ
2
∂k
∂k∂c
2
=
D(k,c) P =  2
(83)
2
∂ P (·)
∂ 2 P (·)
−(1 + β)φλ
9
2
∂c∂k
∂c
The determinant of the above matrix is −4/9 − (1 + β)2 φ2 λ2 , which is clearly
negative. Notice that the determinant is the leading principal minor of the
second order. As the leading principal minor of the even order is negative,
the Hessian is by definition indefinite and, therefore, the Pontryagin function
is nowhere jointly concave in state and control variables. Hence, the ArrowMangasarian sufficiency conditions are not satisfied.
41 In
Figure 2, this corresponds to the locus c˙ = 0, which is the horizontal line k =
becoming tangent to the locus k˙ = 0 at its peak.
42 For details of sufficiency conditions, see, for instance, Grass et al. (2008).
37
1
,
(1+β)φ
E
Proof of Lemma 3
To prove the lemma, we consider the state-co-state form of the solutions, given
by:

0 ≤ c < 1, λ ≤ 0
c 1 + 12 (1 + β)2 φ2 λc ,





c,
0 ≤ c < 1, λ > 0
c˙ =
(84)

c 1 + 21 (1 + β)2 φ2 λc ,
c ≥ 1, λ ≤ 0




c,
c ≥ 1, λ > 0,
λ˙ =






2
τ (1
− c) + (ρ − 1 − 12 (1 + β)2 φ2 λc)λ,
0 ≤ c < 1, λ ≤ 0
2
τ (1 − c) + (ρ − 1)λ,
1 − 21 (1 + β)2 φ2 λc)λ,
0 ≤ c < 1, λ > 0
(ρ − 1)λ,
c ≥ 1, λ > 0,
(ρ −





H(c, λ) =











1
τ (1
− c)2 + 14 (1 + β)2 φ2 λ2 c2 + λc,
1
4 (1
1
2
τ (1 − c)
2 2 2 2
c ≥ 1, λ ≤ 0
(85)
0 ≤ c < 1, λ ≤ 0
+ λc,
0 ≤ c < 1, λ > 0
+ β) φ λ c + λc,
c ≥ 1, λ ≤ 0
λc,
c ≥ 1, λ > 0,
where τ = 8 for the full cartel and τ = 9 for the R&D cartel.
Introduce the characteristic function χS of a set S by
(
1 if x ∈ S
χS (x) =
0 if x ∈
/ S.
(86)
(87)
Differential equations (84) and (85) can be rewritten more compactly as:
1
def
c˙ = c + χ(−∞,0] (λ) (1 + β)2 φ2 λc2 = F (c, λ)
2
(88)
and
1
2
def
λ˙ = (ρ − 1)λ − χ(−∞,0] (λ) (1 + β)2 φ2 cλ2 + χ[0,1) (c) (1 − c) = G(c, λ). (89)
2
τ
The state-co-state space is sketched in Figure 17. We first consider the trajectories in the region where c ≥ 1 and λ > 0, which are the solution to the following
canonical system:
(
c˙ = c
(90)
λ˙ = (ρ − 1)λ,
given by λ(t) = C1 e(ρ−1)t and c(t) = C2 et , where C1 and C2 are positive constants. As mentioned in the main text, every candidate for an optimal solution
must necessarily satisfy the transversality condition limt→∞ e−ρt λ(t)c(t) = 0.
As we now show, the trajectories in the region considered violate this condition.
Moreover, they also violate the following transversality condition:
lim e−ρt H(c, λ) = 0,
t→∞
38
(91)
q > 0,
0.6
k=0
q = 0,
k=0
q = 0,
k>0
0.4
S1
λ˙ = 0
λ
0.2
0
c˙ = 0
λ˙ = 0
S
3
−0.2
−0.4
q > 0,
0
k>0
0.5
1
1.5
c
Figure 17: Illustrative sketch of the state–co-state space (for the full cartel with
ρ < 1). The dotted vertical line c = 1 separates the region with zero production
from the region with a positive level of production, whereas the horizontal line
λ = 0 separates the region with positive investment from the region with zero
investment. The loci c˙ = 0 and λ˙ = 0 intersect in the two unstable steady states
(S1 and S3 ) and the saddle-point steady state S2 (not indicated). A number
of trajectories is indicated by black curves: the arrows point in the direction of
the flow. The thick curve indicates the stable path leading to the saddle-point
steady state.
which in this case coincides with the first transversality condition, as shown
below. It follows from Michel (1982) that condition (91)43 is also a necessary
condition, and hence that it allows exclusion of trajectories which verify the
other necessary conditions in an infinite horizon optimization problem. The
value of the Hamiltonian function evaluated along the considered trajectories is
H(c, λ) = λ(t)c(t) = C1 C2 eρt ; it follows that
lim e−ρt H(c, λ) = lim e−ρt λ(t)c(t) = C1 C2 6= 0.
t→∞
t→∞
(92)
Hence, no trajectory in the region given by the restriction c ≥ 1 and λ > 0 can
be optimal.
Consider now trajectories in the region with 0 < c < 1 and λ > 0. These
trajectories are the solution to
(
c˙ = c
(93)
λ˙ = τ2 (1 − c) + (ρ − 1)λ.
43 In
words, the above condition means that the present value of the maximum of the
Pontryagin function (the present value of the Hamiltonian) converges to zero when time goes
to infinity.
39
We show that any trajectory in this region sooner or later enters the region with
c ≥ 1 and λ > 0.
Observe that the equation c˙ = c has as a solution c(t) = C3 et , where C3 is
a positive constant. Hence, along any trajectory in this region, c is increasing.
From (85) follows that if λ = 0, then
2
λ˙ = (1 − c) > 0
τ
for all c ∈ (0, 1). Hence, trajectories in the region with 0 < c < 1 and λ > 0
cannot exit this region through the line segment {(c, λ) : c ∈ (0, 1), λ = 0}. We
now show that they also cannot exit through the point (c, λ) = (1, 0).44
Let x = (c, λ) and x0 = (c0 , λ0 ) = (1, 0). Furthermore, let F : D → R2 be a
vector function defined as
F(c, λ) = F (c, λ), G(c, λ) ,
where F is defined in (88) and G in (89).45 Its domain is D = R+ × R ⊂ R2 .
We are then looking for a solution to a 2-dimensional nonlinear autonomous
dynamical system of the form
x˙ = F(x),
x(0) = x0 .
(94)
Take 0 < r < 1. Then the set
D0 = Br (x0 ) = {x ∈ D : kx − x0 k ≤ r},
(95)
is a neighborhood of x0 . First, we show that the restriction of the functions F
and G on D0 (a compact subset of D) is Lipschitz 46 . Consider first the function
(1 + β)2 φ2 2
c .
2
and set h(λ) = χ(λ)λ. Then,
F (c, λ) = c + λχ(−∞,0] (λ)
Write for brevity χ = χ(−∞,0]
|h(λ) − h(0)| = |χ(λ)λ − 0| = |χ(λ)λ| ≤ |λ| = 1 · |λ − 0|.
(96)
Hence, h is Lipschitz on D0 . As on compact sets continuously differentiable
functions are Lipschitz, as well as sums and products of Lipschitz functions, it
follows that F is Lipschitz on D0 .
Consider now on D0 the function
G = (ρ − 1)λ − λ2 χ(−∞,0] (λ)
(1 + β)2 φ2
2(1 − c)
c+
χ[0,1) (c).
2
τ
44 If they exited through this point, they would satisfy the transversality conditions as λ
˙ =0
for λ = 0 and c ≥ 1.
45 Though continuous, functions F and G are not differentiable: F is not differentiable with
respect to λ at λ = 0, whereas G is not differentiable with respect to c at c = 1. Using
Peano’s existence theorem, the continuity of F implies that at least one solution to (94) exists.
However, continuity is not enough to guarantee uniqueness. An additional condition that
needs to be fulfilled to guarantee uniqueness of solutions, at least in some neighborhood of
x0 , is that F is locally Lipschitz in x at x0 .
46 Simply put, a function f : D → Rn is said to be Lipschitz on B ⊆ D if there exists
a constant K > 0 such that kf (x) − f (y)k ≤ Kkx − yk, for all x, y ∈ B. See Kelley and
Peterson (2010), ch. 8, and Sohrab (2003), ch. 4, for the introduction to concepts and precise
definitions of terms used in this section.
40
The first term of this expression is a linear function; the second term is the
function h introduced above times a differentiable function. That the final term
is Lipschitz at c = 1 is demonstrated in the same way as for the function h. It
follows that G is Lipschitz on D0 .
We have proved that the functions F and G are locally Lipschitz at x0 .
Consequently, so is also the vector function F. By the Picard-Lindel¨of Theorem,
the system (94) has a unique solution in a neighborhood of x0 . As this point is
on the exit trajectory, no other trajectory can pass through it, in particular no
trajectory from the region with λ > 0. As c˙ > 0 in the region with 0 < c < 1 and
λ > 0, as shown above, all trajectories in this region exit through {c = 1, λ > 0}
and enter the region with c ≥ 1 and λ > 0 as t → ∞. We have already shown
that none of these can be optimal.
Moreover, trajectories through points in the set {(c, λ) : c ∈ (0, 1), λ = 0}
satisfy λ˙ > 0; they move in the region for which λ > 0 and hence cannot
be a part of any optimal trajectory. Due to a one-to-one correspondence between λ and k as given in the first part of (23), this set corresponds to the set
{(c, k) : c ∈ (0, 1), k = 0} in the state-control representation of the solution. As
we have shown, all trajectories leading to these points violate the transversality
conditions which every optimal trajectory must necessarily satisfy.
F
Proof of Lemma 4
Assume that there is an optimal investment schedule for which c → 0 and
k → ∞ as t → ∞.
The instantaneous profit function of each firm in every instant is:
π = (1 − 2q − c)q − k 2 .
(97)
As c → 0 along the trajectories considered, they sooner or later enter the region with positive production, where we have q = (1 − c)/4. Substituting this
expression into (97), we obtain:
π=
(1 − c)2
− k2 .
8
(98)
As c → 0, (1 − c)2 /8 approaches its upper bound of 1/8. The second term in
the above equation, −k 2 , decreases beyond all bounds as k → ∞. As t → ∞,
there is therefore a time t0 such that π(t) = 0 for t = t0 , and π(t) < 0 for all
t > t0 . Changing the investment schedule to k(t) = 0 for all t ≥ t0 would yield
a higher value of total discounted profits Π, contradicting the assumption.
In the case of the R&D cartel the instantaneous profit is
π(c, k) =
(1 − c)2
− k2 .
9
(99)
As c → 0, (1 − c)2 /9 approaches its upper bound of 1/9. The second term in
the above equation −k 2 however increases beyond all bounds as k → ∞. The
rest of the argument is analogous to the one for the full cartel above.
41
G
Proof of Lemma 5
We know that the state-control system with zero production is:
˙
k(t)
= ρk(t)
c(t)
˙
= c(t) (1 − (1 + β)φk(t)) .
˙
At k = 0, this reduces to: k(t)
= 0, c(t)
˙ = c. Hence, marginal costs increase
to infinity along the exit trajectory as t → ∞. However, as λ = 0 for c ≥ 1 along
the exit trajectory, the transversality condition (19) is satisfied. Observe that
along the exit trajectory k > 0 for 0 < c < 1 and k = 0 for c ≥ 1. Consequently,
it follows from (23) that λ = 0 at c = 1. Then, from (85) we have that indeed
λ = 0 for all c ≥ 1. Increasing marginal costs along the exit trajectory sooner or
later exceed the value of 1, for which the value of H(c, λ) becomes 0 (see (86)).
Hence, the transversality condition (18) is satisfied as well.
H
Proof of Corollary 1
TO BE DONE
The result of the corollary follows directly from lemmata (3)-(5) and the
directions of trajectories in the phase diagram provided in Figure 2. It is easy
to observe that given the trajectories in the phase diagram, if we exclude trajectories defined in Lemma 3 and Lemma 4, we are left exactly with the two
trajectories defined in the corollary. As the latter both satisfy the transversality
conditions, the result of the corollary is obtained. We now prove that the directions of trajectories in different sectors of the phase diagram are indeed those
that are in the phase diagram indicated in squared brackets.
˙ For c < 1,
Consider first the equation for k.
(1 + β)φ
k˙ = ρk −
c(1 − c).
8
(100)
FC
˙
The k˙ = 0 isocline is given by k F C = (1+β)φ
8ρ c(1 − c). Then, k > 0 if k > k
and k˙ < 0 if k < k F C . For c ≥ 1, k˙ = ρk. Hence, in this case, k˙ > 0 for all
k 6= 0.
Consider now equation for c,
˙ given by
c˙ = c(1 − (1 + β)φk).
(101)
1
The c˙ = 0 isoclines are given by c = 0 and k = (1+β)φ
= K. Assume c > 0.
Then, c < 0 if k > K and c > 0 if k < K.
˙ For c < 1,
For the R&D Cartel: Consider first the equation for k.
(1 + β)φ
k˙ = ρk −
c(1 − c).
9
(102)
RC
˙
The k˙ = 0 isocline is given by k RC = (1+β)φ
9ρ c(1 − c). Then, k > 0 if k > k
RC
and k˙ < 0 if k < k . For c ≥ 1, k˙ = ρk. Hence, in this case, k˙ > 0 for all
k 6= 0.
42
Consider now equation for c,
˙ given by
c˙ = c(1 − (1 + β)φk).
The c˙ = 0 isoclines are given by c = 0 and k =
Then, c < 0 if k > K and c > 0 if k < K. Q.E.D.
I
(103)
1
(1+β)φ
= K. Assume c > 0.
Proof of Lemma 7
From (33), we know that the Hamiltonian for c ≥ 1 is given by:
2
H(c, k) = k k −
(1 + β)φ
(104)
and from Lemma 6, we know that the comparison of the total discounted profits
of each two candidate optimal paths amounts to comparing the values of the
respective Hamiltonians in the initial point of each respective path. As the
Hamiltonian in the case of zero investment and zero production is zero, the
indifference point between the stable path and the exit trajectory in the region
with zero production must be the point at which the Hamiltonian evaluated
along the stable path obtains the value of zero.
2
=0
H(c, k) = k k − (1+β)φ
⇒k=0
or k =
2
(1+β)φ
The solution structure (41)-(42) tells us that the stable path in the region
with c ≥ 1 decreases as t → −∞, assuming that the stable path covers the
respective region. Observe that the derivative of the Hamiltonian with respect
2
1
to k is ∂H(c,k)
= 2k − (1+β)φ
, which is positive for k > (1+β)φ
and negative
∂k
1
for k < (1+β)φ .
If the stable path enters the region with zero production at all, it enters this
1
; this follows directly from equation (30).
region at the point where k > (1+β)φ
The same equation implies that k is decreasing along the stable path as
t → −∞, assuming the stable path enters the zero-production region. The above
conclusions lead us to distinguish three cases. First, if the stable path crosses
2
the boundary line c = 1 at k > (1+β)φ
, then the value of the Hamiltonian is
decreasing along the stable path in the region with c ≥ 1 as t → −∞, passes zero
2
at k = (1+β)φ
and is negative afterwards. In this case, the indifference point is
2
the value of the marginal cost cˆ > 1 that corresponds to the point cˆ, (1+β)φ
on
2
the stable path. Second, if the stable path reaches c = 1 at exactly k = (1+β)φ
,
then the indifference point is c = 1. Third, for lower values of k on the stable
path at c = 1, the value of the Hamiltonian evaluated along the stable path in
the zero-production region is negative, such that the point of indifference (if at
all) must be in the region with positive production.
J
Proof of Proposition 2
√
Assume φ > 6 ρ/(1 + β). The stability of the steady-states can be analyzed by
evaluating the trace and determinant of the following Jacobian matrix:
43
"
J
RC
=
∂ c˙
∂c
∂ k˙
∂c
∂ c˙
∂k
∂ k˙
∂k
#
"
=
1 − (1 + β)φk
−(1 + β)φc
− (1+β)φ(1−2c)
9
ρ
#
.
(105)
36ρ
,
(1 + β)2 φ2
(106)
At c = k = 0, the trace τ of the matrix J RC is given as
def
τ = tr J RC = 1 + ρ > 0,
its determinant ∆ is
def
∆ = det J RC = ρ > 0,
and its discriminant D is
def
D = τ 2 − 4∆ = (1 − ρ)2 > 0.
Hence, this steady state is an unstable node.
Evaluating the Jacobian matrix at
(1 + β)φ
c(1 − c),
k=
9ρ
1
1 1
c= +V = +
2
2 2
s
1−
we obtain τ = ρ > 0 and
p
p
(1 + β)2 φ2 − 36ρ (1 + β)φ + (1 + β)2 φ2 − 36ρ
∆ =
(107)
18
1 2
= (1 + β)2 φ2 V
+ V ,
(108)
9 9
√
which is clearly positive if φ > 6 ρ/(1 + β). Hence, this steady state is also
unstable. The discriminant takes the value
p
2
(109)
D = ρ(8 + ρ) − (1 + β)φ (1 + β)φ + (1 + β)2 φ2 − 36ρ ,
9
√
3 ρ(8+ρ)2
which is zero for φ = φ0 = √
, negative for φ > φ0 , and positive for
2
2
√
6 ρ
(1+β)
(1+β) (4+ρ)
≤ φ < φ0 . The steady state is an unstable node if D > 0 and an unstable
focus if D < 0. In the latter case, the eigenvalues of J M are complex conjugates
with positive real parts.47
Finally, evaluating the Jacobian matrix at
s
(1 + β)φ
1
1 1
36ρ
k=
c(1 − c), c = − V = −
1−
,
(110)
9ρ
2
2 2
(1 + β)2 φ2
we obtain τ = ρ > 0 and
(1 + β)2 φ2 − 36ρ − (1 + β)φ
∆=
18
p
(1 + β)2 φ2 − 36ρ
.
(111)
√
If φ > 6 ρ/(1 + β), then ∆ < 0, and the eigenvalues are real and have opposite
sign. Therefore, (110) is the saddle-point steady state of the system. Observe
47 Note
that the eigenvalues of J RC are r1,2 =
44
1
2
√
τ ± τ 2 − 4∆ .
√
that for φ = 6 ρ/(1 + β), the two steady states (106) and (110) coincide at c =
1
. A saddle-node bifurcation occurs at these parameter values,
1/2 and k = (1+β)φ
where the two equilibria collide and disappear. Substituting the expression for
the steady-state marginal cost of the steady states other than the origin into
(68), the expression for the optimal investment in the steady state simplifies to
1
.
k RC = (1+β)φ
References
[1] Adams, J., Griliches, Z., 1996. Measuring science: an exploration. Proceedings of the National Academy of Sciences 93: 12664-12670.
[2] Armentano, D. T., 1999. Antitrust and Monopoly. Anatomy of a Policy
Failure. 2nd Edition. The Independent Institute.
[3] Cellini, R., Lambertini, L., 2009. Dynamic R&D with spillovers: competition vs cooperation. Journal of Economic Dynamics & Control 33, 568-582.
[4] D’Aspremont, C., Jacquemin, A., 1988. Cooperative and Noncooperative
R&D in Duopoly with Spillovers. The American Economic Review 78:
1133-1137.
[5] Elzinga, K. G., Mills, D. E., 2011. The Lerner Index of Monopoly Power:
Origins and Uses. American Economic Review: Papers & Proceedings
101(3): 558-564.
[6] Geroski, P. A.,1992. Vertical relations between firms and industrial policy.
Economic Journal 102: 138-147
[7] Grass, D., Caulkins, J. P., Feichtinger, G., Tragler G., Behrens, D. A.,
2008. Optimal Control of Nonlinear Processes. Springer.
[8] Graves, S. B., Langowitz, N. S., 1993. Innovative Productivity and Returns
to Scale in the Pharmaceutical Industry. Strategic Management Journal 14:
593-605.
[9] Hounshell, D.A., Smith, J.K., Jr., 1988. Science and Corporate Strategy:
Du Pont R&D, 1902-1980). Cambridge University Press, New York.
[10] Hinloopen, J., Smrkolj, G., Wagener, F. O. O., 2011. From Mind to Market: A Global, Dynamic Analysis of R&D. Tinbergen Institute Discussion
Paper.
[11] Jacquemin, A., 1988. Cooperative Agreements in R&D and European Antitrust Policy. European Economic Review 32: 551-560
[12] Judd, K. L., 1998. Numerical Methods in Economics. Massachusetts Institute of Technology.
[13] Kamihigashi, T., 2001. Necessity of Transversality Condition for Infinite
Horizon Problems. Econometrica 69: 995-1012.
[14] Kiseleva, T., Wagener, F. O. O., 2009. Bifurcations of optimal vector fields
in the shallow lake model. Journal of Economic Dynamics & Control.
45
[15] Leibenstein, H., 1966. Allocative Efficiency and X-Efficiency. The American
Economic Review, 56: 392-415.
[16] Lindenberg, E. B., Ross S. A., 1981. Tobins Q Ratio and Industrial Organization. Journal of Business, 54(1): 132.
[17] Martin, S., 1995.R&D joint ventures and tacit product market collusion.
European Journal of Political Economy 11: 733-741
[18] Michel, P., 1982. On the Transversality Condition in Infinite Horizon Optimal Problems. Econometrica 50: 975-985.
[19] Madsen, J. B., 2007. Are there diminishing returns to R&D? Economic
Letters 95: 161-166.
[20] Petit, M.L., Tolwinski, B., 1999. R&D cooperation or competition? European Economic Review 43: 185-208.
[21] Pindyck, R. S., 1985. The Measurement of Monopoly Power in Dynamic
Markets. Journal of Law and Economics, 28(1): 193-222.
[22] Salant, S. W, Shaffer, G., 1998. Optimal asymmetric strategies in research
joint ventures. International Journal of Industrial Organization 16, 195208.
[23] Scherer, F. M., 1980. Industrial Market Structure and Economic Performance. Chicago, Rand McNally College Publishing Co.: 430-431
[24] Schwartzman, D., 1976. Innovation in the Pharmaceutical Industry. Baltimore: John Hopkins University Press.
[25] The Focus, 2007. Competition. Vol. XI/2.
[26] Vernon, J. M. and Gusen, P., 1974. Technical Change and Firm Size: The
Pharmaceutical Industry. Review of Economics and Statistics 56: 294-302.
[27] von Hippel, E., 1988. ”The Sources of Innovation”, New York: Oxford
University Press.
[28] Wagener, F.O.O., 2003. Skiba points and heteroclinic bifurcations, with
applications to the shallow lake system. Journal of Economic Dynamics &
Control 27, 1533-1561.
46