I am trying to understand how big O is used for multiple variables, especially within the context of mathematical optimization. For example, in this paper, they mention a previous iteration complexity result (on page 2, under "Our results and their relation to previous work:"), which studied a support vector machine optimization algorithm and found that it takes $O(m^4/\epsilon)$ iterations in order to be within $\epsilon$ of optimality, where $m$ is the number of variables. This usage of big O has multiple variables, which are $m$ and $\epsilon$. The exact line is quoted below:
Combining these results, we see that the decomposition algorithm of Hush and Scovel for problem SVO is within $\epsilon$ of optimality after $O(m^4/\epsilon)$ iterations.
I am currently confused by the exact meaning of this use of big O. So far, any references I have found that discuss big O with multiple variables solely discuss the case where the variables of interest approach infinity (see Wikipedia). However, for the example I have provided, I feel like they are interested in the case where $m\rightarrow \infty$, and $\epsilon \rightarrow 0$. I know that big O can be defined in the univariate case so that it covers both finite and infinite limits as follows: $$ f(x) \text{ is } O(g(x)) \text{ as } x \rightarrow a \text{ if } \limsup_{x \rightarrow a} \frac{|f(x)|}{g(x)} < \infty,$$ where $a \in \overline{\mathbb{R}}$ with $\overline{\mathbb{R}}:=\mathbb{R}\cup\{-\infty,+\infty\}$. Intuitively speaking, I think this definition can be extended to the case where $f,g$ are defined on $\mathbb{R}^d$ as follows: $$ f(x) \text{ is } O(g(x)) \text{ as } x \rightarrow a \text{ if } \limsup_{x \rightarrow a} \frac{|f(x)|}{g(x)} < \infty,$$ where $ a \in \overline{\mathbb{R}}^d$. Using this definition and letting $N(m,\epsilon)$ denote the iteration complexity, my current interpretation of the result in the example is that $N(m,\epsilon) \text{ is } O(m^4/\epsilon)$ as $(m,\epsilon)\rightarrow (+\infty,0)$.
This leads me to several related questions:
- Is the definition I have provided for big O in multiple dimensions something that is standard, or is it just jibberish?
- Is my interpretation according to the definition I provided correct? If not, what is the correct interpretation of the iteration complexity result?
- It feels like most papers are written assuming readers already know the meaning of big O for iteration complexity... Does anyone ever provide its precise meaning?