4. Planet Packing#

Gravity is a long-range force/effect that attracts all bodies with mass to each other. If one considers two masses (\(m_1\) and \(m_2\)) on nearby orbits around a central body \((m_o;\ m_1\ll m_o\ \&\ m_2\ll m_o)\), then it is expected that each mass will perturb the other mass’ orbit causing it to either speed up or slow down a little. However, if one body is substantially more massive or close enough, then the lower mass body can be scattered to a wide/eccentric orbit or escape the central body altogether. Through numerical experiments involving scattering, it leads to a seemingly simple question:

  • How close can two (or more) planets be separated such that scattering does not occur?

This question requires a few things to be defined so that limited numerical experiments can be performed. For example, the above main question leads to four other sub-questions:

  1. How long is necessary to say that a scattering event will not occur?

  2. How do we measure the separation between orbits?

  3. Is there a scale-free way to define the previous two questions?

  4. Will our choice of initial orbital elements bias the potential outcomes? If so, how to overcome this bias?

4.1. Stellar lifetimes as a constraint on stability#

Orbital stability often goes undefined, even though it can have multiple meanings. Typically, it implies that a mass will remain on a bound orbit indefinitely (i.e., Lagrange stability; see Hayashi et al. (2023) or Barnes & Greenberg (2006)). Using indefinitely as a timescale is not practical. Scattering events can also transport a mass to a wider orbit without fully expelling the mass from the system. To address the potential for scattering events, please review updating the standard stability formulae in the previous section. Otherwise, we look to natural constraints on time to arrive at a more practical definition for stability.

One natural constraint on the timescale for stability is the lifetime of the system. To judge the lifetime of a system, we can look to astrophysics and the lifetimes of stars. Consider the stability of the Solar System, where we might define it to be stable if the planets remain on bound orbits for \({\sim}10\ {\rm Gyr}\), or the main-sequence lifetime of a typical \(\rm G2V\) star. Even this definition has problems because the planet Mercury has a some probability of undergoing an instability (e.g., Laskar & Gastineau (2009)) within the remainder of the Sun’s main-sequence lifetime.

If Mercury is expelled from the Solar System, does that make the whole system unstable?

The short answer is no because the remaining planets will adjust/exchange their angular momentum until a new equilibrium is found (see Laskar (1997)) and was worked out mathematically by Laplace in 1784. This may have occurred in the past, where the giant planets’ orbits were more compact (Quarles & Kaib (2019), Nesvorny et al. (2018), Nesvorny (2011)). The giant planets may have mutually scattered their orbits and resulted in the configuration we see today. An example simulation of this process can be found at www.billyquarles.com/latest-research.

The main-sequence lifetime of stars can be determined numerically given that we have some accurate estimates for a given star’s composition (e.g., hydrogen, helium, and metal mass fraction). The details can be found in a chapter on stellar evolution in Modern Astrophysics. The general estimate of main-sequence stellar lifetime \(\tau_{\rm ms}\) is that it is proportional to the inverse-cube of the stellar mass, \(\tau_{\rm ms} \propto M^{-3}\). Through the proportionality, we can scale other stars relative to our Sun. For example, a \(10\ M_\odot\) star’s lifetime is \(10^{-3}\) times \(10\ {\rm Gyr}\), or \(1\ {\rm Myr}\). See the lecture from Jason Kendall below for a general overview.

4.2. Scaling a planetary system using the Hill Radius#

4.2.1. From Lagrange points to the Hill radius#

In the three-body problem, the relative gravitational influence on a third body can be determined in which there are five special locations called the Lagrange points. These points define extrema in the gravitational potential, where the net gravitational force on the third body vanishes. Additionally, they correspond to high and low topological levels of pseudopotential \(U\). This pseudopotential exists within a rotated reference frame, where two massive bodies appear at fixed locations and a third body reacts to the presence of the massive primaries.

Consider two massive primaries lying along the \(x\)-axis of a reference coordinate system with the center of mass (barycenter) at the origin. Since the barycenter lies at the origin, then the center of mass can be determined using:

(4.1)#\[\begin{align} 0 &= m_o x_o + m_1 x_1, \qquad {\rm where}\ m_o > m_1;\\ a_{\rm bin} &= |x_o| + |x_1|, \\ x_o &= -\frac{m_1}{m_o+m_1}a_{\rm bin}, \\ x_o &= -\mu a_{\rm bin}, \quad {\rm and}\quad x_1 = (1-\mu)a_{\rm bin}. \end{align}\]

Two intermediate constants: the binary semimajor axis \(a_{\rm bin}\) and mass ratio \(\mu\) are introduced to simplify the notation, but they also provide a convenient means to scale the problem. The above analysis can be extended into two dimensions using the pseudopotential \(U\):

(4.2)#\[\begin{align} U(x,y) & = \frac{1-\mu}{r_o} + \frac{\mu}{r_1} + \frac{n^2}{2}\left(x^2+y^2\right), \\ r_o &= \sqrt{(x+\mu)^2 + y^2}, \\ r_1 &= \sqrt{[x-(1-\mu)]^2 + y^2}. \end{align}\]

The first two terms are simply the gravitational potential relative to each of the masses (\(m_o\ \&\ m_1\)) measured by the respective radii (\(r_o\ \&\ r_1\)). But the last term arises from the rotated coordinate system as a centrifugal potential.

Note

The coordinates in the pseudopotential are also scaled by the binary semimajor axis \(a_{\rm bin}\). In these coordinates, the mean motion \(n = 1\). If you want to express everything in non-scaled coordinates, the \(\mu\) and \((1-\mu)\) need to be replaced with \(x_o\) and \(x_1\), respectively.

Taking the partial derivatives of the pseudopotential \(U\) leads to deriving the forces within the rotated coordinate system (i.e., \(F = -\nabla U\)). Integrating the full equations of motion derived from the pseudopotential leads to the Jacobi Integral and the associated constant of motion, the Jacobi constant \(C_J\). The Jacobi constant is given mathematically as:

(4.3)#\[\begin{align} C_J = 2U - v^2, \end{align}\]

which is similar in appearance to an energy relation. It is not because we are considering the restricted problem (\(m_2\ll m_1 < m_o\)), where energy and angular momentum aren’t conserved.

Why do we bother with the rotated coordinate system? Special boundaries arise that constrain the motion of the smaller mass. These boundaries are called the zero velocity contours (ZVC), which are defined when \(v=0\) as the name suggests. As a result, the Jacobi constant for the ZVC is \(C_J = 2U\). Using the ZVC, we can illustrate the extent of gravitational influence of each mass \(m_o\) and \(m_1\).

The existence of a given contour depends on the value of the Jacobi constant, which in turn depends on the mass ratio \(\mu\) and initial semimajor axis ratio \(\rho\) through the pseudopotential. To identify a closed contour, we must choose initial values in \(\mu\) and \(\rho\) so that the contour is continuous through the inner Lagrange point \(L_1\). Fortunately, Quarles et al. (2020) provides a table for the circular restricted three body problem.

Using the Sun-Jupiter system (\(\mu \approx 0.999\)), the critical semimajor axis ratio \(\rho \approx 0.33\). The python code below illustrates the closed contour around the Sun (yellow dot) and a much smaller closed contour enclosing Jupiter (orange dot) using units of Jupiter’s semimajor axis \(a_J\). The red x symbols mark the locations of the Lagrange points. The filled gray area marks a forbidden region, where the Jacobi constant \(C_J > 2U\). For this to occur, the particle would have a complex velocity \(v\), or \(v^2<0\). If we force the velocity to be a real number, then it shifts the physical interpretation so that the particle mass is imaginary. These solutions are deemed unphysical and hence, forbidden.

Note

The lowest value in the Jacobi constant occurs for the \(L_4\) and \(L_5\) Lagrange points. This shows why Jupiter is still has asteroids (i.e., Trojans) orbiting at these locations

import numpy as np
import matplotlib.pyplot as plt
import matplotlib.cm as cm
import matplotlib.colors as colors

def pseudo_pot(x,y,mu):
	sign = 1
	if mu >= 0.5:
		sign = -1 #flip the sign to maintain orientation
	ro = np.sqrt((x+sign*mu)**2+y**2)
	r1 = np.sqrt((x-sign*(1.-mu))**2+y**2)
	U = (1.-mu)/ro + mu/r1 + (x**2+y**2)/2.
	return 2.*U 

def calc_ZVC(rho):
	CK = mu + 2.*mu*rho  + (1.-mu)/rho + 2.*mu/(1.+rho) + 2.*np.sqrt(rho*(1.-mu)) + mu*(1.-mu)
	return CK

mu = 0.999
alpha = 1 - mu
rho = 0.0329
C1 = calc_ZVC(rho)

if mu <=0.5:
	L1 = (mu/3.)**(1./3)
	L3 = -(1.+(5*mu/12.))
else:
	L1 = (alpha/3.)**(1./3)
	L3 = -(1.+(5*alpha/12.))

cmap = cm.gnuplot
my_cmap = cm.get_cmap(cmap)
norm = colors.Normalize(0,1)
cmmapable = cm.ScalarMappable(norm,my_cmap)
cmmapable.set_array(range(0,1))
my_cmap.set_under('k')

x_color='r'
ms = 3

X = np.arange(-1.5,1.5,0.01)
Y = np.arange(-1.5,1.5,0.01)
xx,yy = np.meshgrid(X,Y)
Z = pseudo_pot(xx,yy,mu) 

fig = plt.figure(1,figsize=(5,5),dpi=150)
ax = fig.add_subplot(111,aspect='equal')

ax.contourf(X,Y,Z,levels=[3,C1],cmap=cm.binary)
ax.contour(X,Y,Z,levels=[3,C1],colors='k',linestyles='solid',linewidths=2)

ax.plot(-(1-mu),0,'y.',ms=20)
ax.plot(mu,0,'.',color='orange',ms=8)
ax.plot(1-L1,0,'x',color=x_color,ms=ms)
ax.plot(1+L1,0,'x',color=x_color,ms=ms)
ax.plot(L3,0,'x',color=x_color,ms=ms)
ax.plot(0.5 - alpha,np.sqrt(3.)/2.,'x',color=x_color,ms=ms) # for $mu <= 0.5 this should be 0.5 - mu
ax.plot(0.5 - alpha,-np.sqrt(3.)/2.,'x',color=x_color,ms=ms)
		
ax.set_xlim(-1.25,1.25)
ax.set_ylim(-1.25,1.25)
ax.set_ylabel("Y $(a_J)$")
ax.set_xlabel("X $(a_J)$");
../_images/c435e0b5413beb6ec4acc09c2fe293801d01f33047dde693bcda5982ad970420.png

Below is a python code that computes the ZVC using many values of the Jacobi constant \((C_J = 3,\ 3.05,\ 3.1,\ 3.15)\) for the Earth-Moon system. Three of the Lagrange points (black dots) are collinear, while the remaining two are the triangular Lagrange points. Particles that start near \(L_4\) or \(L_5\) are bound within the lobe-shaped contours.

Interior to the gray contours lie regions where either the Earth or Moon dominate in the gravitational tug-of-war and delineate each body’s Hill Sphere. The use of ‘sphere’ isn’t mathematically correct, where the generalization of the region to three dimensions is actually an ellipsoid. This can be seen in that the region around the Moon is more squashed in the vertical compared to the horizontal extent.

import numpy as np
import matplotlib.pyplot as plt

def potential(x,y):
	ro = np.sqrt((x+mu)**2+y**2)
	r1 = np.sqrt((x-(1.-mu))**2+y**2)
	U = (1.-mu)/ro + mu/r1 + (x**2+y**2)/2.
	return 2.*U 

mu = 1./81.
xi = np.arange(-2.,2.,0.001)
yi = np.arange(-2.,2.,0.001)
xx,yy = np.meshgrid(xi,yi)
zi = potential(xx,yy)
L1 = (mu/3.)**(1./3)
L3 = -(1.+(5*mu/12.))
L4 = 0.5-mu
col = 'k'
mark = '.'
ms = 15

fig  = plt.figure(figsize=(6,6),dpi=150)
ax = fig.add_subplot(111,aspect='equal')

#ax.plot((1.-mu)*np.cos(theta),(1.-mu)*np.sin(theta),'k-',lw=2)
CS = ax.contour(xi,yi,zi,np.arange(3,3.2,0.05),cmap=plt.cm.Set1)
ax.plot([-1.25,1.35],[0,0],'k--',ms=1.5)
ax.plot(1.-L1,0,marker=mark,color=col,ms=ms)
ax.text(1.-L1,0.065,'$L_1$',fontsize=20,horizontalalignment='center')
ax.plot(1.+L1,0,marker=mark,color=col,ms=ms)
ax.text(1.+L1+0.025,0.065,'$L_2$',fontsize=20,horizontalalignment='center')
ax.plot(L3+mu,0,marker=mark,color=col,ms=ms)
ax.text(L3-0.1+mu,0.065,'$L_3$',fontsize=20,horizontalalignment='center')
ax.plot(L4,np.sqrt(3)/2.,marker=mark,color=col,ms=ms)
ax.text(L4,np.sqrt(3)/2.+0.065,'$L_4$',fontsize=20,horizontalalignment='center')
ax.plot(L4,-np.sqrt(3)/2.,marker=mark,color=col,ms=ms)
ax.text(L4,-np.sqrt(3)/2.-0.135,'$L_5$',fontsize=20,horizontalalignment='center')

ax.plot([L4,-mu],[np.sqrt(3.)/2.,0],'k--',lw=1.5)
ax.plot([L4,1.-mu],[np.sqrt(3.)/2.,0],'k--',lw=1.5)
ax.plot([L4,-mu],[-np.sqrt(3.)/2.,0],'k--',lw=1.5)
ax.plot([L4,1.-mu],[-np.sqrt(3.)/2.,0],'k--',lw=1.5)

ax.plot(-mu,0,'.',mfc='skyblue',mec='skyblue',ms=50,label='Earth')
ax.plot(1.-mu,0,'.',mfc='darkblue',mec='darkblue',ms=20,label='Moon')
#ax.plot(0,0,'kx',ms=5,label="CoM")

ax.legend(bbox_to_anchor=(1.1, .5, .3, .5),numpoints=1,fontsize=20,frameon=False)
ax.set_xlim(-1.3,1.3)
ax.set_ylim(-1.3,1.3)
ax.axis('off');
../_images/dfd59926efde1959d10c495ec7fcbfab188d1990669391abeec5bb69dafce423.png

The appearance of the contours change with the mass ratio, which can be seen below if we consider the Sun-Earth system instead. The inset panel zooms in on the region around the Earth and is scaled using the Hill radius \(R_H\). The Hill radius is defined using the mass ratio \(\mu\) and the binary separation \(a_{\rm bin}\):

(4.4)#\[\begin{align} R_H = a_{\rm bin}\left(\frac{\mu}{3}\right)^{1/3}. \end{align}\]

Note that this definition is the result of an approximation that assumes circular orbits, where an additional term \((1-e_{\rm bin})\) would be necessary if the binary orbit were elliptical. The definition effectively becomes

(4.5)#\[\begin{split}R_H &\approx q_{\rm bin}\left(\frac{\mu}{3}\right)^{1/3}, \\ q_{\rm bin} &= a_{\rm bin}(1-e_{\rm bin}).\end{split}\]
import numpy as np 
import matplotlib.pyplot as plt 
from mpl_toolkits.axes_grid1.inset_locator import inset_axes
import matplotlib.colors as cm

def Jacobi_const(x,y,xo,yo,x1,y1):
    ro = np.sqrt((x-xo)**2+(y-yo)**2) #distance from particle to the Sun
    r1 = np.sqrt((x-x1)**2+(y-y1)**2) #distance from particle to the Earth
    phi = (1.-mu)*(ro**2/2. + 1./ro) + mu*(r1**2/2. + 1./r1)
    return 2*phi

G = 4*np.pi**2
M_sun = 1 # 1 solar mass
M_E = 3.0035e-6 #mass of Earth in M_sun
mu = M_E/(M_E+M_sun)
R_H = (mu/3)**(1./3.)
x_E, y_E = (1-mu), 0
x_S, y_S = -mu, 0

r_levels = [0.3,0.6,0.9]
for i in range(1,10):
    r_levels.append(1+i*0.25*R_H)
n_lev = len(r_levels)
Z_levels = np.zeros(n_lev+1)
for r in range(0,n_lev):
    Z_levels[r] = Jacobi_const(r_levels[r],0,x_S,y_S,x_E,y_E)
Z_levels[-1] = Jacobi_const(0.5-x_E,np.sqrt(3)/2,x_S,y_S,x_E,y_E)
Z_levels.sort()
print(Z_levels)

fig = plt.figure(figsize=(5,5),dpi=150)
ax = fig.add_subplot(111, aspect='equal')
axins = inset_axes(ax, width="40%", height="40%",bbox_to_anchor=(0.25, 0.05, 1, 1), bbox_transform=ax.transAxes)
axins.set_aspect('equal')

ax.plot(x_S,y_S,'y*',ms=15)
ax.plot(x_E,y_E,'b.',ms=10)
axins.plot(0,0,'b.',ms=10)
axins.plot(1,0,'rx',ms=5)
axins.plot(-1,0,'rx',ms=5)

xi = np.arange(x_E-1.5*R_H,x_E+1.5*R_H,0.01*R_H)
yi = np.arange(y_E-1.1*R_H,y_E+1.1*R_H,0.01*R_H)
xx,yy = np.meshgrid(xi,yi)
Z = Jacobi_const(xx,yy,x_S,y_S,x_E,y_E)

vmin, vmax = 3, 3.003
my_cmap = plt.colormaps['Set1']
norm = cm.Normalize(vmin=vmin, vmax=vmax)
#my_cmap.set_under('k')

ax.contour(xx,yy,Z, levels=Z_levels,zorder=5,cmap=my_cmap,norm=norm)
axins.contour((xx-x_E)/R_H,yy/R_H,Z, levels=Z_levels,zorder=5,cmap=my_cmap,norm=norm,linewidths=0.5)
axins.set_xlabel("X ($R_H$)")
axins.set_ylabel("Y ($R_H$)")

xi = np.arange(-1.1,1.1,0.001)
yi = np.arange(-1.1,1.1,0.001)
xx,yy = np.meshgrid (xi,yi)
Z = Jacobi_const(xx,yy,x_S,y_S,x_E,y_E)
ax.contour(xx,yy,Z, levels=Z_levels,zorder=2,cmap=my_cmap,norm=norm,linewidths=1)
ax.set_ylabel("Y (AU)")
ax.set_xlabel("X (AU)")
ax.set_yticks(np.arange(-1,1.5,0.5))
ax.set_xticks(np.arange(-1,1.5,0.5));
[3.00000347 3.0008897  3.00093672 3.00095947 3.00106035 3.00124328
 3.0012661  3.00147675 3.00175584 3.00240875 3.03227121 3.69332466
 6.75659148]
../_images/a1256c8d58cedf9a07a35b7543ef871279bc4d3b53c6af37af90590addeacde5.png

The squashing in the vertical (see panel inset) is even more obvious in this regime. Rosario-Franco et al. (2020) shows using numerical simulations that the orbital stability of satellites are limited to \({\sim}0.4\ R_H\) for a host planet on a circular orbit. The stability decreases as the host planet’s orbit is more eccentric, as we may naively expect from Eqn. (4.5). The orbital stability formula for prograde (Rosario-Franco et al. (2020)) and retrograde (Quarles et al. (2021)) are:

(4.6)#\[\begin{align} a_{\rm crit}(R_H) &= 0.4061(1 - 1.1257e_p), \qquad &{\rm (prograde)} \\ a_{\rm crit}(R_H) &= 0.668(1 - 1.236e_p). \qquad &{\rm (retrograde)} \end{align}\]

The above formula demonstrate that general expressions for orbital stability are possible that scale with the observationally measurable parameters. Typical observations can determine the semimajor axis through either photometric modeling or analysis using the measured radial velocity curve. If the system is favorable (i.e., edge on), then the masses can also be determined and with many more observations, the eccentricity can be estimated. The Hill radius directly depends on the semimajor axis and masses, \(R_H = a_{\rm bin}(\mu/3)^{1/3}\).

4.2.2. Mutual Hill radius and scaling#

The Hill radius is strictly defined for a three body system, but can the idea be expanded to include systems of multiple planets. One of the first papers to investigate this idea with a specific application comes from Gladman (1993). There were previous works in the 1980s that use algebraic topology to characterize orbits using their energy and angular momentum (see Marchal & Bozis (1982)). Gladman (1993) wanted to apply the earlier works towards the problem of two nearby planets orbiting a single star.

This first section is following the notation given in Gladman (1993), but we should be bound by notation. If you review the literature that stems from Gladman (1993), you’ll find that there isn’t a consistent notation that everyone agrees to use anyway.

The central body (i.e., host star) has a mass \(m_3\), which is orbited by two planets on concentric orbits with masses \(m_1\) (inner orbit) and \(m_2\) (outer orbit). The semimajor axis of the inner orbit \(a_1 = 1\). If you scale the problem by this inner orbit, it will also be equal to unity while the outer orbit \(a_2\) will be some (real) multiple of the inner orbit (e.g., \(a_2 = 1.5 a_1\)). The planets begin with a differential initial longitude of \(180^\circ\) (\(\delta f = f_2 - f_1 = 180^\circ\)).

Gladman setup

Fig. 4.1 Initial setup used to define the initial separation \(\Delta\) (Gladman (1993)).#

Figure 4.1 illustrates that we can then parametrize the initial separation of orbits using \(\Delta = a_2 - a_1\), or \(a_2 = 1 + \Delta\) in normalized units. Through another transformation, the separation \(\Delta\) can be written in terms of Hill’s coordinates (see Henon & Petit (1986)), which scale with the Hill radius \(R_H\). If we normalize with respect to the Hill radius, then we define

(4.7)#\[\begin{align} \Delta_H &= \frac{a_2 - a_1}{R_H}. \end{align}\]

The above expression appears straightforward if you assume that \(R_H\) describes the Hill radius of \(m_1\). Then \(\Delta_H\) is describing the separation between the two orbits in units of \(R_H\). However, it’s not as clear once you expand to more than two planets or if you want to vary the planetary masses.

Gladman (1993) does show a critical value of \(\mathbf{\Delta_H = 2\sqrt{3}}\) for two planets on initially circular orbits. This is not a bad assumption if you consider arguments from planet formation where it is expected that the planetary eccentricity could be significantly damped due to drag forces within the protoplanetary disk resulting in near circular orbits.

A few years later, Chambers et al. (1996) take a slightly different approach by defining the mutual Hill radius. The mutual Hill radius uses the total mass interior to the \(j+1^{\rm th}\) planet’s orbit \(M_j\) and the average semimajor axis of the \(j^{\rm th}\) and \(j+1^{\rm th}\) planet. Mathematically, this is expressed as

(4.8)#\[\begin{align} R_{H_{j,j+1}} &= \frac{a_j + a_{j+1}}{2} \left[ \frac{m_j+m_{j+1}}{3M_j} \right]^{1/3}, \\ M_j &= \sum_{j=1}^j m_j. \end{align}\]

When using this formalism to setup a planetary system, the value for \(a_{j+1}\) is not known a priori. It can be written explicitly in terms of \(a_{\rm j}\), the masses (\(m_j\), \(m_{j+1}\), \(M_j\)), and \(\Delta_H\).

Note

The symbol representing the Hill spacing \(\Delta_H\) varies among different authors, where Pu & Wu (2015) use \(K\) and Smith & Lissauer (2009) use \(\beta\). For consistency here, \(\Delta_H\) is used.

The explicit formula to define \(a_k\), the semimajor axis of the \(k^{\rm th}\) planet, is given as

(4.9)#\[\begin{align} a_k &= a_j \left[\frac{1+\Delta_H X}{1-\Delta_H X} \right]^{k-j}, \\ X &= \frac{1}{2}\left(\frac{m_j+m_{k}}{3M_j} \right)^{1/3}, \end{align}\]

where \(k>j\) and are both integers (Obertas et al. 2017). This general formula can be applied for \(k = j + 1\) to obtain a recursive formula as well.

The python code demonstrates the setup from Gladman (1993) using an inner Jupiter-mass planet and outer Earth-mass planet separated by \(5\ R_{H,m}\), mutual Hill radii, for a 2-planet system.

import numpy as np 
import matplotlib.pyplot as plt

M_star = 1 #central mass in M_sun
m_1 = 9.54e-4 #Jupiter mass in M_sun
m_2 = 3.0035e-6 #Earth mass in M_sun

a_1 = 1 #inner orbit in normalized units (1 AU)
X = 0.5*((m_1+m_2)/(3*(M_star+m_1)))**(1./3.)
delta_H = 5

def calc_orbit(a,e,omg):
    #calculate Cartesian orbit given a, e, omega
    f = np.arange(0,2*np.pi,0.01) 
    r = a*(1-e**2)/(1.+e*np.cos(f))
    return (r*np.cos(omg+f), r*np.sin(omg+f))

def get_semi(delta_H,a_j,X,k,j):
    #calculate a_k given a_j
    return a_j*((1+delta_H*X)/(1-delta_H*X))**(k-j)

a_2 = get_semi(delta_H,a_1,X,2,1)
x_1,y_1 = calc_orbit(a_1,0,0)
x_2,y_2 = calc_orbit(a_2,0,np.pi)

fig = plt.figure(figsize=(5,5),dpi=150)
ax = fig.add_subplot(111)

ax.plot(0,0,'.',color='y',ms=20)
ax.plot(x_1,y_1,'k-',lw=2)
ax.plot(x_1[0],y_1[0],'c.',ms=10)
ax.plot(x_2,y_2,'-',color='gray',lw=2)
ax.plot(x_2[0],y_2[0],'b.',ms=10)

ax.set_xlabel("X (AU)")
ax.set_ylabel("Y (AU)");
../_images/cd6fbf9c393de0ea78a3680845ffcd4c57400e58143ba7c6342deecc2ade05af.png

4.3. The Golden ratio to debias the initial orbital phase#

The setup from Gladman (1993) starts the two planets at opposition. This is not a completely arbitrary choice for the initial phase of the orbits. There are two processes at play over the first few orbits: a) interplanetary perturbation through gravity and b) differences in mean motion.

Suppose we started both planets from circular orbits with the same true anomaly \(f=0^\circ\) (i.e., opposition). The inner planet would begin its orbit not with the single dominant force from the host star, but with a significant perturbation from the outer planet (due to the close relative proximity). The force from the host star is still larger due to the stellar mass, but the outer planet is leveraging its closer proximity to make up for its much lower mass. We could start the planets a slightly different true anomalies to account for this initial bias in forcing, where the mitigation of this bias appears to be achieved at conjunction (i.e., differential true anomaly \(\Delta f = 180^\circ\)).

The above argument is only half of the story. From Kepler’s third law of planetary motion, the orbital period of a planet with a square proportionally with the planetary semimajor axis cubed (\(T_p^2 \propto a_p^3\)). The mean motion \(n\) is simply converting the orbital period into an orbital frequency (e.g., \(n = 2\pi/T_p\)). Each planet orbits with a different frequency, where more distant planets have a lower frequency.

Even though the planets start out with a large differential true anomaly, this will not be true for long. The faster inner planet will catch up to the outer planet and be introduced to the outer planet’s perturbation within a half-orbit of each planet. The inner planet will experience repeated perturbations each orbit, but the force from each perturbation (‘kick’) may not always be along the same direction of motion resulting in random gravitational kicks. If the perturbations occur at periodic intervals, then orbital resonance can destabilize the orbit by increasing the inner planet’s orbital speed (i.e., increasing its orbital eccentricity), which can lead to orbit crossing. See the video below for a refresher on orbital resonance.

Orbital resonances occur at any periodic (integer) ratio, where the setup by Gladman (1993) introduces a bias when the planets begin near a 2:1 resonance (i.e., two inner planet orbits for every orbit of the outer planet). To address this issue, we can take some advice from different patterns found in nature. One pattern emerges from plants, where they orient themselves to catch the maximum attention from pollinators (e.g., sunflowers) or the maximum area to receive sunlight. The video below demonstrates this pattern at about 14:00 with pine cones and pineapples.

The pattern can be seen again in the spiral arms of galaxies, which is explained in detail in the video below.

Since our goal is to avoid orbital resonances (i.e., rational numbers), we must seek out an irrational number. The most irrational number is the golden ratio \(\varphi\), which can ironically be represented by ratios:

(4.10)#\[\begin{align} \varphi = \frac{1 + \sqrt{5}}{2} = 1.618033988749\ldots, \\ \varphi^{-1} = \varphi - 1 = 0.618033988749\ldots, \end{align}\]

and can be related to Fibonacci numbers.

Smith & Lissauer (2009) and Quarles & Lissauer (2018) use the golden ratio to initialize the orbits of multiple planet systems. This is done simply by taking integer multiples of \(\phi\) to define the initial mean/true anomaly for the \(j^{\rm th}\) planet. Mathematically, we get

(4.11)#\[\begin{align} f_j = (j\varphi \cdot 360^\circ) \mod 360^\circ,\ \qquad (j\geq 1) \end{align}\]

where the \({\rm mod}\) operator refers to the modulo that returns the remainder to keep the result within the bounds of \(0-360^\circ\). To convert the above scheme into radians, simply replace \(360^\circ\) with \(2\pi\).

4.4. A sample simulation#

Let’s create a Solar System analog that contains a solar-mass star, Jupiter-mass planet, and 3 Earth-mass planets. The Jupiter-mass planet is an analog to Jupiter in our Solar System, where it has a semimajor axis \((a_J = 5.2\ AU)\) and eccentricity \((e_J \approx 0.05)\) similar to Jupiter’s current orbit. The three Earth-mass planets will be added using our rules described above for planet packing (i.e., initially spaced using their mutual Hill radius and arranged using the golden ratio) and beginning on circular orbits.

The first Earth-mass planet begins at \(1\ {\rm AU}\), and the other two planets have initial semimajor axes that have \(\Delta_H = 10\). This could be calculated by-hand, but we create a function (get_semi) below given the indices of the \(j^{\rm th}\) and \(k^{\rm th}\) planets, which makes it light work. As a result, we have \(a_j\) for each planet as:

(4.12)#\[\begin{align} a_1 &= 1\ {\rm AU}, \\ a_2 &= 1.1345\ {\rm AU}, \\ a_3 &= 1.2871\ {\rm AU}, \\ a_4 &= 5.2\ {\rm AU}. \end{align}\]

The true anomaly (i.e., initial orbital phase) of each planet is calculated using a function get_TrueAnomaly, which uses the index of the \(j^{\rm th}\) planet.

In the n-body simulation, we save the system states every 5 years over a 500-year timescale using a simulation archive. Through this timescale, we are looking to see how the inner system of Earth-mass planets is perturbed by the gravity from the Jupiter-analog. The perturbations could be strong enough such that two planets come quite near each other and a fixed timestep algorithm (i.e., whfast) may not be sufficient. In this example, we use the IAS15 integrator to account for these close approaches, although it may not be the most efficient.

Note

Some works (e.g., Smith & Lissauer (2009)) use a hybrid scheme (Chambers (1999)) for the numerical integration. In this scheme the Wisdom-Holman mapping method (Wisdom & Holman (1991)) is used when the separation between planets is greater than \({\sim}3\ R_{H,m}\) and Burlisch-Stoer method is used otherwise. This method is quite efficient and accurate when the close approaches are rare.

import rebound
import numpy as np 

def get_semi(delta_H,a_j,k,j):
    #calculate a_k given a_j (k>j)
    M_tot = np.sum(Mass[:k])
    X = 0.5*((Mass[j]+Mass[k])/(3*M_tot))**(1./3.)
    return a_j*((1+delta_H*X)/(1-delta_H*X))**(k-j)

def get_TrueAnomaly(j):
    return (j*phi*2*np.pi) % (2*np.pi)

def simulation(tscale,fname,out_step):
    sim = rebound.Simulation()
    sim.integrator = 'ias15'
    sim.units = ('yr', 'AU','Msun')
    sim.dt = 0.05*T_1 

    sim.add(m=M_star)
    for j in range(1,5):
        f_j = get_TrueAnomaly(j)
        sim.add(m=Mass[j],a=semi[j],e=ecc[j],f=f_j)
    sim.move_to_com()

    sim.automateSimulationArchive(fname,step=out_step,deletefile=True)
    sim.integrate(tscale)

M_sun = 1 #solar mass in M_sun
M_E = 3.0035e-6 #Earth mass in M_sun
M_J = 9.54e-4 #Jupiter mass in M_sun
phi = (1+np.sqrt(5))/2.

M_star = M_sun
m_1 = M_E
m_2 = M_E
m_3 = M_E
m_4 = M_J
Mass = [M_star,m_1,m_2,m_3,m_4]

a_1 = 1 #innermost orbit starting a 1 AU
mu_1 = (m_1/M_star)**(1./3.)
T_1 = np.sqrt(a_1**2/M_star) #innermost orbital period in years
delta_H = 10
semi = [0,a_1]
for k in range(2,4):
    semi.append(get_semi(delta_H,a_1,k,1))
semi.append(5.2)
print(semi[1:])
ecc = np.zeros(5)
ecc[-1] = 0.05

output_bin = "SSanalog_test.bin"
output_steps = 5
simulation(500,output_bin,output_steps)
[1, 1.1345183686262765, 1.2871316026156066, 5.2]

After running the code, it is usually easiest to interpret the results through a series of plots/figures. Such diagrams need to show a couple of things:

  • How are the orbits changing over time? Are they getting more or less elliptical? Do they cross one another?

  • How do the orbital elements change over time?

First we plot the positions of each planet using all the data, where we will need an inset panel to see the inner system better.

import rebound
import numpy as np
import matplotlib.pyplot as plt 
from mpl_toolkits.axes_grid1.inset_locator import inset_axes

#plot the simulation
sa = rebound.SimulationArchive(output_bin)

fig = plt.figure(figsize=(5,5),dpi=150)
ax = fig.add_subplot(111)
ax.set_aspect('equal')
axins = inset_axes(ax, width="150%", height="150%", bbox_to_anchor=(.9, .7, .3, .5), bbox_transform=ax.transAxes)
axins.set_aspect('equal')

ax.plot(0,0,'kx',ms=5)
xy_coords = np.zeros((len(sa),2*4))
for s in range(0,len(sa)):
    sim = sa[s] #iterate through each snapshot in sa
    ps = sim.particles #intermediate object to simplify the referencing
    sim.move_to_hel() #shift to center-of-mass coordinates
    for p in range(0,len(ps)-1):
        #Cart = np.array([ps[p].x, ps[p].y, ps[p].z])
        #rot_xy = Rot(-ps[1].Omega,-ps[1].inc,Cart)
        xy_coords[s,2*p] = ps[p+1].x
        xy_coords[s,2*p+1] = ps[p+1].y

color = ['r','c','b','orange']
for p in range(0,4):   
    ax.plot(xy_coords[:,2*p],xy_coords[:,2*p+1],'.',color=color[p],ms=2,alpha=0.3)
    ax.plot(xy_coords[0,2*p],xy_coords[0,2*p+1],'.',color=color[p],ms=10)
    if p < 3:
        axins.plot(xy_coords[:,2*p],xy_coords[:,2*p+1],'.',color=color[p],ms=2,alpha=0.3)
        axins.plot(xy_coords[0,2*p],xy_coords[0,2*p+1],'.',color=color[p],ms=10)
axins.plot(0,0,'kx',ms=10)
axins.set_xlim(-1.25,1.25)
axins.set_ylim(-1.25,1.25)

ax.set_xlabel("x (AU)",fontsize=16)
ax.set_ylabel("y (AU)",fontsize=16);
../_images/bc0b7de21ea671e9918187d88f818d32fd11bcfc02ddc9d1b2462215382328a9.png

The next view plots only the magnitude \(e_p\) and direction \(\omega_p\) for the eccentricity vector of each terrestrial planet. Along with the vector components of the respective eccentricity vectors.

The strength of gravitational perturbation for each terrestrial planet is modulated by its distance from the outer Jupiter-mass planet. Therefore, the eccentricity for each of the terrestrial planets should respond to the gravity of the Jupiter-mass planet and to the smaller perturbations from its neighbor planets.

import rebound
import numpy as np
import matplotlib.pyplot as plt
from matplotlib.gridspec import GridSpec

#plot the simulation
sa = rebound.SimulationArchive(output_bin)

fig = plt.figure(figsize=(10,5),dpi=150)
gs = GridSpec(2, 4, figure=fig,wspace=0.9,hspace=0.15)
ax1 = fig.add_subplot(gs[:1,:2])
ax2 = fig.add_subplot(gs[1:,:2])
ax3 = fig.add_subplot(gs[:,2:],aspect='equal')

time = np.zeros(len(sa))
pl_orb = np.zeros((len(sa),4,3))
for s in range(0,len(sa)):
    sim = sa[s] #iterate through each snapshot in sa
    ps = sim.particles #intermediate object to simplify the referencing
    sim.move_to_hel() #shift to center-of-mass coordinates
    time[s] = sim.t
    for p in range(0,len(ps)-1):
        pl_orb[s,p,:] = [ps[p+1].a,ps[p+1].e,ps[p+1].omega]

color = ['r','c','b','orange']
for p in range(0,3):
    e_p, omg_p = pl_orb[:,p,1], pl_orb[:,p,2]
    omg_p[omg_p>np.pi] -= 2*np.pi
    ax1.scatter(time,e_p,marker='o',color=color[p],edgecolor='None',s=4,alpha=0.5)
    ax2.scatter(time,np.degrees(omg_p),marker='o',c=color[p],edgecolor='None',s=4,alpha=0.5)
    ax3.scatter(e_p*np.cos(omg_p),e_p*np.sin(omg_p),marker='o',c=color[p],edgecolor='None',s=4,alpha=0.5)
ax1.set_ylabel("$e_p$")
ax2.set_ylabel("$\omega_p$ (deg.)")
ax3.set_ylabel("$e_p\ \sin{\omega_p}$")

ax1.set_xlabel("Time (yr)")
ax2.set_xlabel("Time (yr)")
ax3.set_xlabel("$e_p\ \cos{\omega_p}$")
ax2.set_yticks(np.arange(-180,180+60,60))
ax3.grid(True,alpha=0.3);
../_images/180e4d5a54673f5456ff75f4bbcd6011f0351ec464c0732ee94aa0b1a9d07c00.png

4.5. References#