5
$\begingroup$

Short version:

Before solving the problem (in particular, before identifying the cycloid as the only possible minimizing curve), is there any physical intuition why we should expect there to be a smooth path from $(0,1)$ to $(1,0)$ (or any other two particular points) which minimizes the time taken for an ideal marble to roll? In particular, is there a way to think of this in terms of some notion of "potential energy"?


Longer version:

Despite the lack of an extreme value theorem, some calculus of variations problems can be expected to be solvable based on the intuition that "ground states" exist for any meaningful notion of energy. E.g. in the hanging cable problem, we have a clear notion of gravitational potential energy, and the nonexistence of a minimizer for the corresponding functional would tell us that (in an idealized universe) a hung cable would keep wobbling into ever-lower energy states. This doesn't stand up to much scrutiny at a rigorous level, but at least for me (and my students) it gives some intuitive expectation that there should actually be a minimizer for the relevant functional.

The brachistochrone problem, however, doesn't seem to have an interpretation in the same way: when I taught it I wasn't able to come up with an idealized physical process which would approach a solution by way of decreasing some notion of potential energy. I'm curious whether such an explanation exists, or at least whether there is some physical intuition for why we should expect the brachistochrone functional to have a minimizer.

(This is really part of a larger confusion of mine, namely "Given the terrible failure of compactness in all reasonable function spaces why do so many natural/interesting calculus of variations problems turn out to be solvable?," but I think that's too broad to be appropriate for this site.)

New contributor
Noah Schweber is a new contributor to this site. Take care in asking for clarification, commenting, and answering. Check out our Code of Conduct.
$\endgroup$
4
  • 3
    $\begingroup$ "Given the terrible failure of compactness in all reasonable function spaces why do so many natural/interesting calculus of variations problems turn out to be solvable?" -- you might consider that we, as mere mortal humans, might happen to be interested in problems that are reasonably simple and thus are more likely to have solutions. $\endgroup$ Commented 12 hours ago
  • $\begingroup$ Compactness seems like something far stronger than necessary. A completeness property on the metric space of paths (say, continuous and of bounded variation) ought to be enough. $\endgroup$ Commented 8 hours ago
  • $\begingroup$ I copy a paragraph from my existing answer: The Brachistochrone problem straddles two categories of application of calculus of variations. It's a Statics problem in the sense that the solution is a static shape: which shape has the property that sliding along it happens in the fastest time? But to get at the solution Dynamics must be applied: given a curve, how fast will an object slide down the curve? It's not a Statics problem: something is moving. It's not a Dynamics problem: for any value of the gravitaitonal acceleration the solution has the same shape. $\endgroup$ Commented 7 hours ago
  • $\begingroup$ Re last paragraph: But we do use compactness several times when discussing minimizers of functionals (particularly in the Sobolev space context). Namely we use Banach-Alaoglu and also the Rellich compactness theorems. See Evans’ PDE book chapter 8 for explicit assumptions on the class of functionals considered. $\endgroup$ Commented 5 hours ago

2 Answers 2

5
$\begingroup$

Before solving the problem (in particular, before identifying the cycloid as the only possible minimizing curve), is there any physical intuition why we should expect there to be a smooth path from $(0,1)$ to $(1,0)$ (or any other two particular points) which minimizes the time taken for an ideal marble to roll?

Of course. Let's generalize and say $A=(0,1)$ and $B=(1,0)$; usually we will assume that the marble is operating under the sole influence of gravity, in which case there is a maximum possible acceleration toward $B$, and the time it would take to cross the $\sqrt2$ distance between $A$ and $B$ under constant acceleration $g$ is $\Delta t\ge \sqrt{\frac{2d}{a}}=\sqrt{\frac{2\sqrt 2}{g}}\approx \frac{1.6818}{\sqrt g}$. But when $B$ does not lie directly below $A$, the marble will not spend its entire journey accelerating at $g$ towards $B$ (even if the path were a straight line so that the acceleration were always directed towards $B$, there would be a $\cos \theta$ term that would reduce the acceleration parallel to the line connecting $A$ and $B$), so we have already established a lower bound.

There being a lower bound, there is obviously some path in the space of all possible paths which either takes less time than every other possible path or takes exactly $\approx\frac{1.6818}{\sqrt g}$ time, since it is not possible to accelerate the marble downwards any faster than $g$ to overcome that lower bound; as such, since the amount of time taken is bounded from below, there is some nonzero set of time-minimizing paths. In general, there may or may not be exactly one solution due to this boundedness; in the brachistochrone problem in particular, there is exactly one solution which does minimize the time taken, to approximately $\Delta t\approx \frac{1.826}{\sqrt g}$ in our case, which indeed does very closely approach our lower bound.

Note that gravitational potential didn't come into it at all. If the problem is minimizing a particular variable, all you have to do to prove that there exists some set of optimal minimizing solutions is to bound the variable from below; this makes it so that there is some set of solutions that necessarily cannot be improved upon, since at a certain point (the lower bound in particular), it stops being possible to make the variable any lower and improve a solution any further. Whether these solutions will ever be realized in a physical system (re your wobbling cable) is another matter.

$\endgroup$
2
  • 1
    $\begingroup$ " there is obviously some path in the space of all possible paths which either takes less time than every other possible path or takes exactly [optimal bound]" This is exactly the piece I don't understand in general, and the claim "all you have to do to prove that there exists some set of optimal minimizing solutions is to bound the variable from below" is false in general since the domain of the functional isn't compact. $\endgroup$ Commented 9 hours ago
  • $\begingroup$ Put another way, the role of (gravitational) potential for me is exactly to produce some intuition that boundedness from below should imply existence of a minimizer. $\endgroup$ Commented 8 hours ago
1
$\begingroup$

In my opinion a fundamental insight into the nature of the Brachistochrone problem was put forward by Jacob Bernoulli.

Jacob Bernoulli was the older brother of Johann Bernoulli. Johann Bernoulli was the mathematician who put out the Brachistochrone problem as a challenge to the mathematicians of his time.

Jacob Bernoulli opened his discussion of the Brachistochrone problem with a lemma illustrated with the following diagram:

Jacob Bernoulli Lemma

Diagram 1

Lemma. Let ACEDB be the desired curve along which a heavy point falls from A to B in the shortest time, and let C and D be two points on it as close together as we like. Then the segment of arc CED is among all segments of arc with C and D as end points the segment that a heavy point falling from A traverses in the shortest time. Indeed, if another segment of arc CFD were traversed in a shorter time, then the point would move along AGFDB in a shorter time than along ACEDB, which is contrary to our supposition.


(The diagram is copied from the file with the scan of the of the actual diagram, as published by Jacob Bernoulli in the Acta Eruditorum, May 1697, pp.211-217)

As of this writing available on Archive.org
Source of the diagram



So what Jacob Bernoulli was pointing out:
If you have the solution, a specific curve, and you take any subdivision of that overall curve, then that subsection is by itself an instance of the Brachistochrone problem, and that is valid down to infinitesimally small subsections.


The same from another angle:
The Brachistochrone curve has a property that I will refer to as being 'concatenable'.

The catenary problem is well suited to illustrate that property:
The curve that is the solution to the catenary problem has the property that every subsection of it is an instance of the catenary problem, down to infinitesimally small subsections.

The catenary problem should yield to an approach where you set up an equation that is generic for any of the infinitesimally small subsections in one go.

As we know: the type of equation that meets those demands is a differential equation.

A differential equation is a global equation in the following sense: the curve that is the solution to the differential equation satisfies the differential relation along the entire length of the curve all in one go.

More generally, the solution space of a differential equation is a space of functions, instead of a space of values, as is the case with, say, the type of equation that finds the roots of a polynomial.


In contrast with the above: there is the well known 'Traveling salesman problem'. Let's say you have a set of 10 cities. Now divide the total area into two adjacent areas A and B, such that each area encompasses 5 cities. Set up an optimized itinary for area A and an itinary for area B. What happens if you put those to itinaries back to back? It is unlikely in the extreme that that would result in an itinary that is optimized for the total set of 10 cities. The problem is non-concatenable.




Specific to the Brachistochrone problem:
The Brachistochrone problem straddles two categories of application of calculus of variations. It's a Statics problem in the sense that the solution is a static shape: which shape has the property that sliding along it happens in the fastest time? But to get at the solution Dynamics must be applied: given a curve, how fast will an object slide down the curve?

So yeah, the Brachistochrone problem is rather a falling-in-between-categories case.



About the dynamics of the Brachistochrone problem:

As an object slides down the incline it accumulates velocity, in accordance with $F=ma$.

The Brachistochrone problem would be significantly harder to solve if the amount of acceleration would in any way be a function of already existing velocity.

Fortunately that is not the case: the amount of velocity that the sliding object gains per unit of height is independent of whatever velocity the sliding object has previously gained.



Further reading:

Paul Rojas, 2014
The straight line, the catenary, the brachistochrone, the circle, and Fermat


Preetum Nakkiran:
Geometric derivation of the Euler-Lagrange equation

Preetum Nakkiran uses the catenary problem as motivating example.

We have that the Euler-Lagrange equation is a differential equation. The explicit expression states a local condition, but of course the demand is that it is satisfied for the entire curve all at once, constituting a global demand. Since the explicit statement states a local condition it should be possible to derive the Euler-Lagrange equation using differential reasoning only.

Preetum Nakkiran proceeds to do that: the Euler-Lagrange is derived using differential reasoning only; no involvement of integration by parts, or any integration for that matter.




What it takes to have solid expectation that a single extremum exists

Some time ago I implemented what in effect is numerical analysis of the catenary problem. (The implementation is an interactive diagram, part of an article about calculus of variation that is available on my website.)

I divided the catenary in straight subsections, the subsections connect nodes. In the interactive diagram the nodes can be moved vertically. Moving a node changes the length of the total catenary. Counterweights exert a constant force.

When all the nodes are moved down the potential energy of the counterweights is the dominant term; pulling the nodes further down increases the total potential energy.

When all the nodes are up high the weight of the catenary has large mechanical advantage, so moving the nodes down lowers the total potential energy.

To find the node arrangement with the lowest possible potential energy: iterate to converge onto the extremum.
If there would be 4 nodes:
To the left of node 1 is the suspension point, to the right of node 1 is node 2.

Move node 1, finding the extremum, relative to the current position of node 2.
Proceed to adjust node 2: find the extremum, relative to the current positions of node 1 and node 3.
Rinse and repeat.

Keep cycling the sequence of nodes.

This iterative process converges onto a global extremum. The fact that the solution space is subdividable ensures that there is a single, unique solution.

The larger the number of nodes, the better the approximation onto the analytical solution.


Porting the numerical analysis reasoning to the Brachistochrone problem

The crudest implementation would be a single node in between the two fixed end points. That would give a profile consisting of two straight sections, joined at the node. Obtain the gained velocity from the height difference. Changing the height of the node changes the length of each section. Under those circumstances the existence of an extremum is garanteed.

With a large number of nodes:
Then each triplet of nodes is converged to its extremum, iterating over all the noddes. In the converged end state: each triplet is at its extremum, therefore the convergeed state is a global extremum.

$\endgroup$
2
  • 1
    $\begingroup$ This is helpful for solving the brachistochrone problem, but not for giving pre-solution intuition that a minimizer exists in the first place - which is what I'm asking about here. $\endgroup$ Commented 9 hours ago
  • $\begingroup$ @NoahSchweber I added to the answer. Dscussion: "What it takes to have solid expectation that a single extremum exists" $\endgroup$ Commented 7 hours ago

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.