1 Introduction

For a historical understanding of the development of Weierstrass’s notion of uniform convergence, it is necessary to understand the connection of the concept as inserted into a text with its context. This paper will mainly focus on the connection between the (verbal and symbolic) form and the mathematical content. Especially in the nineteenth century, we see the growing role of symbols for explicating the meaning of mathematical concepts—until the definitive symbolic form of mathematical statements is achieved. It is on passing from earlier, more rhetorical forms of text, to more symbolic forms that, for instance, ambiguities in verbal formulations become clarified. An important example of this is the emergence of the absolute sign notation. In the course of his lectures, Weierstraß introduced his concepts of uniform convergence, using three different verbal formulations that can be assigned to different periods of the development of this concept in terms of mathematical rigor and applications (i.e., theorems). Behind some of his introductions of uniformity lay a mathematical condition slightly different from uniform convergence. His introductions of uniformity can be conceptually separated into uniform convergence and the Cauchyan uniformity condition (that is to say a sequence of functions is uniformly Cauchy). Under the given circumstances, this notion is equivalent to uniform convergence, which turned out to become the classical notion of the concept. Below I will generally use the wording “uniform convergence” as a generic term for these two notions.

2 Did Weierstraß introduce uniform convergence as early as in 1841?

2.1 The role of Christoph Gudermann and his formulations of convergence

After the young Karl Weierstraß abandoned his studies at the University of Bonn, he took on more serious studies of mathematics at the Akademie in Münster in 1839–40, where he became a student of Christoph Gudermann. Weierstraß attended his course on elliptic functions and started to work on his teacher’s Theorie der Modular-Functionen, which Gudermann began to publish in Crelle’s Journal in 1838. This work - dealing with the convergence of infinite series in the context of Jacobian elliptic functions - contains a variety of formulations for different rates of convergence, but these rates of convergence are not introduced or explained. He often used the term rasche Konvergenz (fast convergence, cf. Gudermann 1844, pp. 382, 558). In contrast to this, we find the formulations langsam (slow), gering or schwach (weak), indicating different rates of convergence (e.g., Gudermann 1844, pp. 196, 290). But we do not find any further definitions or specifications of these terms in his work. There is also no indication of a difference between the formulations rasch convergent (converging rapidly) and convergent im hohen Grade (convergent at a high rate). The latter term also occurs in his comparison of mathematical expressions with equally high rates of convergence (Gudermann 1844, p. 200). Both formulations, as well as their variantsFootnote 1, seem to describe rates of convergence in terms of pointwise convergence, however, in a quite intuitive way.

There is one passage that has been used to draw a connection to uniform convergence. Reinhard Bölling (1994) interpreted Gudermann’s following statement, which occurs only once and is entirely isolated, as a specific notion of uniform convergence: “It is a remarkable fact that both the infinite products in §58 and the series just found have an overall equal rate of convergence.”Footnote 2 Bölling claimed that the term “overall equal rate of convergence” refers to the notion of uniform convergence, attributing to the quoted statement that the same rate of convergence is meant to hold for all values of the variable. From his perspective, Gudermann seems to ignore the connection to uniform convergence: “As it seems, he did not draw any conclusions from it, let alone he made any sense of the significance of this type of convergence.”Footnote 3 Consequently, we do not find any definition whatsoever of the said term. Moreover, it is questionable to interpret the history of mathematics backward by stating that Gudermann took no notice of uniform convergence, when no concept of it was introduced at the time.

In his doctoral thesis, Kurt Richter (1976, pp. 135–6) assumes that Gudermann not only referred to the term “overall equal rate of convergence” in his publications, but “probably” also in his lecture courses on Modular-Functionen that Weierstraß attended. According to Richter, Weierstraß only adopted the term im gleichen Grad (at the same rate) from Gudermann, who used it in a very restricted manner.Footnote 4 This means that Weierstraß primarily adopted the wording, but not necessarily a concept of uniform convergence, since there is no known introduction of it in Gudermann’s work.

2.1.1 Reinhold Hoppe as a possible recipient of Christoph Gudermann’s formulation

A reception of Gudermann’s formulation Convergenz im gleichen Grade can be attributed to the textbook Lehrbuch der Differentialrechnung und Reihentheorie mit strenger Begründung der Infinitesimalrechnung (1865) by Reinhold Hoppe who studied mathematics at the Universities of Kiel, Greifswald, and Berlin and completed his Habilitation thesis at the University of Berlin in 1853. He could have known Gudermann’s Modular-Functionen, since he gave the following explanation for the rate of convergence for infinite series:

If the general terms \( u_{k} \) and \( v_{k} \) of two infinite series are positive for a sufficiently large number \( k \), the following four cases can be distinguished with respect to the quotient \( \frac{{v_{\omega } }}{{u_{\omega } }} \): it can be finite, infinitely small, infinitely large, or neither of these. In the first case, for sufficiently large \( k \), e.g., \( k \ge m \), \( \frac{{v_{k} }}{{u_{k} }} < c \) for some constant \( c \) […]. Therefore, both infinite series converge or diverge simultaneously, and one can say that they converge or diverge [175] with the same rate; or: they converge or diverge equally strong. […] They have the same rate of convergence, if the ratio of their general terms is finite for an infinitely large number \( k \).Footnote 5

In short, Hoppe’s “convergence with same rate” aims at the rate of convergence of an infinite series. Note that he did not explain, whether the condition \( \frac{{v_{k} }}{{u_{k} }} < c \) should hold for all or for just (in)finitely many numbers \( k \ge m \). The absence of a clarification of this matter is very common, and it can also be found in Weierstrass’s lectures, as we will see. If Hoppe knew Gudermann’s work, he adopted his term im gleichen Grade. However, the different rates of convergence of infinite series had already been considered as early as in the eighteenth century.Footnote 6

2.2 The revision of Weierstrass’s famous article from 1841

The first occurrence of uniform convergence in Weierstrass’s works is often dated to 1841. The corresponding paper Zur Theorie der Potenzreihen was published in Mathematische Werke vol. 1 (1894), but signed as “Münster, im Herbst 1841” (Weierstraß 1841, p. 74). In the introduction and in the notes of the Werke, it is pointed out that “most of the papers” has been revised and “special papers” have also been corrected by the editors and others (Weierstraß 1894, pp. v-vi, p. 356). Unfortunately, the scope of each of these revisions or corrections is not explained. In the case of the 1841 article, neither Alfred Pringsheim, nor Richter nor Bölling doubted that the text published in 1894 is identical with the original text.Footnote 7 A recent example is (Gray 2015), who just speaks of an “early” paper that Weierstraß “wrote in 1841” (pp. 199, 197). I will try to give some strong indications for a later revision regarding the introduction of uniform convergence.

From its first page, the article Zur Theorie der Potenzreihen consistently uses the sign for absolute values, namely a vertical line before and after an algebraic expression. Yet, when one looks into mathematical publications of that time, one will not find a use of this notation. Since the eighteenth century, the practice had been to continue in a rhetorical style, writing something like “the value of … taken absolutely.” Cauchy, in his Cours d’analyse algébrique, practiced a variation of such a rhetorical, sign-less denomination: “valeur numérique de … “. Enno H. Dirksen, although strongly following Cauchy, progressed to the abbreviated mode, the syncopated form: “v.n.”, followed by the algebraic expression in normal brackets. In the lecture notes of Weierstrass’s students, the modern notion of absolute value is absent until late in the 1860s.Footnote 8 There, Weierstraß himself never used the sign, while attributing it in 1894 to himself as of 1841. Before he began to use the modern absolute value sign, his way of expressing the absolute value was either literally “absoluter Betrag von … ” or “Absolutbeträge von … ” or round brackets, which can be found in Pasch’s script from 1865/66 (Fig. 1).Footnote 9 To my knowledge, this notation only appears again in a lecture course on elliptic functions from 1868/69Footnote 10 before it was replaced by the modern notation in a script from 1870/71Footnote 11. In the 1880s, we also find the abbreviation “absol. Betr. von …”.

Fig. 1
figure 1

Weierstrass’s notation for the remainder in connection with a definition of uniform convergence

I am indebted to a private communication from Gert Schubring that the apparently first introduction of that modern sign was made by August Leopold Crelle, in a book of 1823, but this remained an isolated practice, not adopted at the time by other mathematicians.

Furthermore, the 1841 article contains two major theorems, one of which is the double series theorem for power series with several variables. Here, Weierstraß assumed infinitely many power series to be uniformly convergent by using the term gleichmässig convergent (uniform convergent):

There are infinitely many ordinary power series given in a certain sequence:

$$ F_{0} \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right), F_{1} \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right), F_{2} \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right), \ldots ., $$

and we suppose that in a given neighbourhood \( G \) of \( \left( {0,0, \ldots 0} \right) \) not only every single series in this sequence converges unconditionally and uniformly, but also the sum. If we denote the coefficients of \( x_{1}^{{v_{1} }} x_{2}^{{v_{2} }} \ldots x_{\varrho }^{{v_{\varrho } }} \) in \( F_{\mu } \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right) \) with \( A_{{v_{1} ,v_{2} , \ldots v_{\varrho } }}^{\left( \mu \right)} \) and let

$$\sum_{\left( \mu \right)} A_{{v_{1} ,v_{2} , \ldots v_{\varrho } }}^{\left( \mu \right)} = A_{{v_{1} ,v_{2} , \ldots v_{\varrho } }} , $$

one can proof that \( A_{{v_{1} ,v_{2} , \ldots v_{\varrho } }} \) is finite and the equation

$$ \sum_{\mu = 0}^{\mu = \infty } F_{\mu } \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right) = \sum_{\left( v \right)} A_{{v_{1} ,v_{2} , \ldots v_{\varrho } }} x_{1}^{{v_{1} }} x_{2}^{{v_{2} }} \ldots x_{\varrho }^{{v_{\varrho } }} $$

holds for every \( \left( {x_{1} ,x_{2} , \ldots x_{\varrho } } \right) \) from the said neighbourhood.Footnote 12

And, for a very similar theorem (Weierstraß 1841, p. 73), he used the term gleichförmig convergent (uniform convergent). But, based on the lecture notes discussed here, the first trace of the term gleichmäßig was found in a lecture course on elliptic functions from 1870/71, and the term gleichförmig first appeared in a course on analytic functions from 1880/81 (see Sect. 5.6).

These indications strongly suggest that the 1841 article was significantly revised in terms of its mathematical terminology and notation, which makes the dating of the first occurrence of uniform convergence in 1841 highly questionable.Footnote 13 This does not imply that Weierstraß did not use a concept of uniform convergence by this time; however, the terminology and notation related to uniform convergence in (Weierstraß 1841) was introduced years later in his lecture courses (see Sect. 5). As mentioned above, the scope of the corrections and revisions done for the publication in the Werke is unknown. Therefore, the first verifiable trace dates back to 1861, where Weierstraß introduced his concept of convergence at the same rate in a lecture course at the Gewerbeinstitut (Trade Institute) Berlin.

3 The role of Weierstrass’s lectures for the development of uniform convergence

Before Weierstrass’s appointment as a full professor, he published only a few research papers that potentially could deal with the notion of uniform convergence.Footnote 14

The 1843 article, which was originally published in the Jahresbericht of the Progymnasium of Deutsch Crone, was taken over unchanged into the Werke.Footnote 15 The same applies to the 1840 article, as Weierstraß himself stated (cf. Weierstraß 1894, p. 50). Neither work covers research relevant for our focus and the other two papers, especially the 1841 paper, have probably been revised many years later, as discussed in Sect. 2.2.

However, some lecture notes from the period of his teaching activities at the Gewerbeinstitut Berlin have been preserved. These lectures do not cover the topic of analysis in the same depth as the later Berlin University lectures do, since they primarily served a mathematical education of a more practical type. During his teaching activities at the University of Berlin, he did not publish the basic concepts of his approach of analysis, but gave continuously revised lectures reflecting the state of his research. Therefore, the most important resource for investigating the Weierstrassian history of uniform convergence is given by the lecture notes from between 1864 and the early 1880s. They allow a profound insight into Weierstrass’s teaching, which mark his most productive period of work. The level of authenticity is an important aspect and is not only determined by the author of the notes in question; from 1862, Weierstraß lectured seated due to health problems, and a selected student wrote his dictation on the blackboard. Therefore, alterations and errors could have crept in not only during writing down but also in the dictation process. With this in mind, the study of multiple notes of the same lecture course, but by different students is very enlightening.

Do we have any sources apart from the lecture courses from his period at the University of Berlin? The so-called mathematische Seminar was established in 1864, and thereafter Weierstraß gave advanced seminars for degree candidates. One can assume that his research topics were covered in these seminars, but we do not have any significant information on the subjects.Footnote 16

4 Weierstrass’s lectures at the Gewerbeinstitut and their reception

From 1856, Weierstraß taught at the Gewerbeinstitut and, due to his membership in the Royal Prussian Academy of Sciences (Berliner Akademie), he was also able to give some lectures at the University. In 1864, he finally became a full professor at the University of Berlin, and from 1864 his research results became increasingly (and mainly) known through his lectures. His students’ lecture notes, which soon reached out mathematicians outside Berlin and indeed Germany, were the determining factors of this development.

Our knowledge of Weierstrass’s work on uniform convergence is very incomplete for the time until he became a full professor at the University of Berlin. Before turning into a leading figure for the contemporary mathematicians in Germany and abroad, lecture notes of his students were not widespread. In addition, his lecture courses at the Gewerbeinstitut did not represent the state of his research, since that was not pertinent for a technical education. To shed some light into this period, we take a look at two lecture notes which come from courses held by Weierstraß at the Gewerbeinstitut.Footnote 17 These are G. Schmidt’s script on Differential- und Integralrechnung (1859)Footnote 18 and H. A. Schwarz’s lecture notes of Differentialrechnung (1861)Footnote 19. Schmidt’s notes basically represent an introduction to differential and integral calculus, without a consideration of the concept of convergence. The 1861 lecture notes come from one of Weierstrass’s most distinguished disciples and his eventual successor in Berlin, H. A. Schwarz, who described his notes as a “compact summary with weaknesses and imperfections.”Footnote 20 What follows is the first occurrence of convergence at the same rate in Weierstrass’s lectures. It is introduced in the context of the continuity theorem - i.e., Cauchy’s sum theorem; yet, Cauchy’s name is not evoked - and motivated by the (term by term) differentiation of infinite series:

A function is often composed of infinitely many other functions. E.g., a function can be represented by a sum or a product of infinitely many terms. We will now investigate how such a function represented by an infinite series can be differentiated; because the conclusion that the earlier developed rules readily apply to a sum of infinitely many terms is not rigorous and often even incorrect.

Let there be a series whose individual terms are continuous functions of a single variable, and it is assumed that

$$ \varphi_{0} \left( x \right) + \varphi_{1} \left( x \right) + \varphi_{2} \left( x \right) + \varphi_{3} \left( x \right) + \cdots in\;{infinitum} $$

is convergent for all \( x \) between two given limits; then, we must first examine under which circumstances the series represents a continuous function of \( x \). Only then, a series \( s = t_{0} + t_{1} + t_{2} + t_{3} + \cdots in\;inf. \) is called convergent, if the difference between the whole series and the sum of the \( n \) first terms \( s - s_{n} \) can be made as small as one would like by increasing \( n \), or if the sum \( t_{n} + t_{n + 1} + t_{n + 2} + t_{n + 3} + \ldots + t_{n + r} \), where\( r \) is an arbitrary positive number, can be made smaller than any given quantity by increasing \( n \), in such a way that the same \( n \) is to be taken for all values \( x \), and the property is retained, if \( n \) is replaced by \( n + m \), where \( m \). is an arbitrary [?]Footnote 21 number.

This shows that the kind of convergence is meant, which has since been called “convergence at the same rate” (author’s emphasis).Footnote 22

In the above quotation, the first definition describes the convergence of a series without a specific rate of convergence (“[…] if […] \( s - s_{n} \) can be made as small as one would like by increasing \( n \)”), whereas the second introduces convergence at the same rate (“…, or if the sum \( t_{n} + t_{n + 1} + \ldots \) […]”). In short, it is stated that \( s_{n + r} - s_{n - 1} \) should become small for arbitrary \( n \ge N \) and arbitrary \( r > 0 \). I interpret this as the Cauchyan uniformity condition. Here is a brief and modern definition that should be sufficient for the purpose of this article: Let \( s \) be a convergent series of continuous functions from a set \( X \) to a metric set \( Y \), and let \( s_{k} \) be the sum of the first \( k \) terms. Then \( s \) is called uniformly Cauchy, if for all \( \varepsilon > 0 \), there exists a number \( N > 0 \) such that for all \( x \in X \) \( \left| {s_{m} - s_{n} } \right| < \varepsilon \) holds whenever \( m,n > N \) (see discussion in Sect. 6). Assuming that \( Y \) is complete, the series \( s \) is also uniformly convergent. Since we have the completeness of the sets of real and complex numbers, which Weierstraß was aware of, the two concepts are equivalent.

The final sentence of the quote from p. 64 was probably added by Schwarz himself. On the one hand, this remark could have been added later by Schwarz to emphasize that the term “at the same rate” was used since 1861. On the other hand, it could indicate that Weierstraß introduced this new concept of convergence prior to his 1861 course (the German word “seither” can also mean “sth. has always been”). Schwarz possibly knew the new concept from another lecture course of Weierstraß at the Gewerbeinstitut. Still, it needs to be clarified how far this notion can be traced back in Weierstrass’s works.

Following the continuity theorem, he gave a theorem about the term by term differentiation of infinite series:Footnote 23

Let \( f\left( x \right) = \varphi_{0} \left( x \right) + \varphi_{1} \left( x \right) + \varphi_{2} \left( x \right) + \ldots in\;inf. \) and \( \psi \left( x \right) = \varphi_{0}{'} \left( x \right) + \varphi_{1}{'} \left( x \right) + \varphi_{2}{'} \left( x \right) + \ldots in\;inf. \) be two series convergent at the same rate, whose addends are continuous functions of \( x \). So, let \( f\left( x \right) \) and \( \psi \left( x \right) \) be continuous functions of \( x \) and the terms of the series \( \psi \left( x \right) \) be such that each of them is the derivative of the corresponding addend of the series \( f\left( x \right) \). Then, it should be proved that \( \psi \left( x \right) \) is the derivative of \( f\left( x \right) \).Footnote 24

The historian of mathematics Pierre Dugac considered Weierstrass’s proof to be extraordinary for the time (Dugac 2003, p. 126).Footnote 25 He also claimed that Weierstraß was partly inspired by a letter from Niels Henrik Abel to the Norwegian mathematician Bernt Holmboe dated January 16, 1826 (Dugac 2003, p. 127). In this letter, Abel complained how careless mathematicians use to deal with infinite series and explicitly showed that one cannot always differentiate a series term by term (Holmboe 1839, p. 268).

From 1861 the new concepts of convergence at the same rate were adopted by others, e.g., by Ludwig Wilhelm Thomé who studied in Berlin from the summer of 1863 and who was held in high esteem by his teacher Weierstraß (cf. Biermann 1988, p. 129, and Viertel 2014, p. 113). Thomé possibly knew Weierstrass’s lecture on the differential calculus (1861) from copies that were already circulating. In his work Über die Kettenbruchentwicklung der Gausschen Function \( F\left( {\alpha ,1,y,x} \right) \) (1866), Thomé discussed the differentiability of the series \( 1 + \sum_{r = 1}^{\infty } \frac{{a_{1} a_{2} \ldots a_{r} x^{r} }}{{N_{r} N_{r + 1} }} \), where \( N_{r} \) represent specific values of \( F\left( {\alpha ,1,y,x} \right) \) (p. 333 f.). After substituting with \( x = \frac{4z}{{\left( {1 + z} \right)^{2} }} \), he obtained the unconditional and uniform convergence of the corresponding series \( \left( {1 + z} \right)\sum_{r = \varrho }^{\infty } \frac{{a_{1} a_{2} \ldots a_{r} \left( {4z} \right)^{r} }}{{{{\varPhi }}_{r} \left( z \right){{\varPhi }}_{r + 1} \left( z \right)}} \) by a majorant seriesFootnote 26, which also holds in a very similar way for those series obtained by term by term differentiation (Thomé 1866, p. 334).

Given Thomé’s significance for the reception, one can infer that Weierstraß probably had used such a preliminary stage of the Weierstraß M-test before 1866. Nonetheless, this comparison test for uniform convergence of series was not a part of the applications that Weierstraß was going to introduce in his further lecture courses in Berlin. Finally, Thomé gave a short explanation of the term im gleichen Grade:

Every single ones of these series [converges] at the same rate in the assumed region, which means that the sum of the later terms simultaneously becomes infinitely small everywhere.Footnote 27

The formulation “simultaneously … everywhere” indicates the difference to pointwise convergence, but overall Thomé’s description remains unclear, e.g., the sum of the “later terms” could refer to the sum of the remainder as well as to a finite number of terms in the form of \( s_{m} - s_{n} \). He also referred to the permutability of summation and differentiation (and integration respectively) of uniform convergent series—theorems he certainly adopted from Weierstraß:

But if one differentiates a series converging at the same rate by its terms, then, if one obtains again a series converging at the same rate, the latter is known to be the derivative of the continuous function represented by the original series, according to the theorem on the definite and term by term integration of a series converging at the same rate.Footnote 28

In conclusion, his original series in \( 1 + \sum_{r = 1}^{\infty } \frac{{a_{1} a_{2} \ldots a_{r} x^{r} }}{{N_{r} N_{r + 1} }} \) can be derived term by term. Thomé’s article shows that the new meaning of convergence at the same rate was already a part of mathematical research outside of the “Berlin lectures” in 1866. Despite the introduction of this new notion in mathematical research, Hoppe (1865) referred to the term im gleichen Grade for comparing the speed at which two infinite series converge. Because of his claim to provide a “rigorous introduction” to infinitesimal calculus, one can assume that he was not aware of Weierstrass’s research. Therefore, Hoppe’s Lehrbuch indicates that the concept of uniform convergence was not yet commonly known to authors of textbooks.

5 The Berlin Lectures on analytic and elliptic functions

Weierstrass’s mathematical work was almost only accessible through his lecture courses at the University of Berlin.Footnote 29 His lectures were delivered as a cycle of four lectures courses that contain most of his mathematical work. The currently accessible lecture notes are preserved at different locations such as libraries, mathematical institutes or other archives. Some of these notes are privately owned. With regard to the notion of uniform convergence, the lectures on analytic and elliptic functions are the most relevant. According to the university calendar for 1865–1883, these lectures were held cyclically as follows. Individual lecture courses are highlighted by an asterisk in case that students’ notes are available:

  • Lecture courses on analytic functions: WS 1865/66*, SS 1868*, SS 1870, SS 1872, SS 1874*, SS 1876, SS 1878*, SS 1880, WS 1880/81*, WS 1882/83*, 1886*.

  • Lecture courses on elliptic functions: WS 1866/67, WS 1868/69*, WS 1870/71*, WS 1872/73, WS 1874/75*, WS 1876/77, WS 1878/79, SS 1881, SS 1883.

Kurt Richter (1976, p. 143 ff.) was the first to give a mathematical-historical analysis of some lecture notes in regard to the history of uniform convergence, but he focused solely on the courses on analytic functions from SS 1874, SS 1878, WS 1882/83 and 1886.

In the following sections, relevant lecture notes of both elliptic and analytic functions as well as other selected sources will be discussed to achieve a deeper analysis and understanding of the development of uniform convergence in Weierstrass’s work.Footnote 30

Among the lecture notes used for this article, a rather interesting phenomenon occurs: We have two lecture notes from 1868 and a single script of elliptic functions (WS 1868/69)Footnote 31 from an unknown note-taker, in which uniform convergence is completely absent. The scripts from the 1868 course on analytic functions (summer 1868) come from two different students, Wilhelm Killing and Ludwig Kiepert, who both graduated under Weierstraß in 1870.Footnote 32 The lack of the notion of uniform convergence could be related to Kiepert und Killing not seeing the enormous relevance of this new concept and, therefore, missing it in their copies. In fact, however, I did not find topics in the two scripts that would have needed uniform convergence.

Note that Kiepert’s and Killing’s lecture notes are very different in length and structure, which may be due to the fact that they reworked their original notes to different degrees.

The phenomenon of different scripts from the same course occurs again in 1878: There are scripts by Hurwitz and by Rudio. In the editor’s preface, Adolf Hurwitz’s script of a course on analytic functions is described as an “authentic reproduction”Footnote 33 of the lecture course. And we know that Salvatore Pincherle used a copy of Hurwitz’s notes for his work Saggio di una introduzione alla theoria delle funzioni analytiche secondo i principii di Prof. C. Weierstrass (1880). Surprisingly, Ferdinand Rudio’s script differs significantly from Hurwitz’s version and one might doubt that they come from the same lecture course. But, according to the university calendar, a lecture named Theorie der analytischen Functionen was only held during the summer semester. My research on the Abgangszeugnisse (leaving certificates) of Rudio and Hurwitz in the Berlin University Archive verifies both were admitted to this particular course.

5.1 Moritz Pasch’s script 1865

The earliest original lecture notes discussed here come from Weierstrass’s student Moritz Pasch, and they are dated to 1865. Pasch first studied in his hometown at the University of Breslau, where he graduated in 1865. He then moved to the University of Berlin to study under Leopold Kronecker and Weierstraß; in 1875, he became a full professor in Breslau. In his script on analytic functions, we find both a continuity theorem and a theorem about the integration of infinite series of continuous, real functions, which make use of uniform convergence (see Fig. 2).

Fig. 2
figure 2

Nachlass Moritz Pasch, Signatur I, 19, p. 28v–29r

Note that the parentheses around the term \( \sum \varphi \left( x \right) - \varphi_{1} \left( x \right) - \cdots - \varphi_{n} \left( x \right) \) represent the absolute value. An English translation of Fig. 2 is as follows:

If an infinite series of continuous functions \( \varphi \left( x \right) \) converges for all \( x \) of a continuous region and \( \sum \varphi \left( x \right) \) converges at the same rate for all these \( x \) (that is, if for a given \( \delta \), one finds \( \varphi_{1} \left( x \right),\varphi_{2} \left( x \right), \cdots ,\varphi_{n} \left( x \right) \)such that \( \left( {\sum \varphi \left( x \right) - \varphi_{1} \left( x \right) - \cdots - \varphi_{n} \left( x \right)} \right) < \delta \) holds for every such \( x \)), then \( \sum \varphi \left( x \right) \) is a continuous function \( f\left( x \right) \) and \( \int_{{x_{1} }}^{{x_{2} }} f\left( x \right){\rm d}x = \sum \int_{{x_{1} }}^{{x_{2} }} \varphi \left( x \right){\rm d}x \).

We find analogous statements and a theorem on the term by term differentiation for analytic functions, but without proofs.Footnote 34 This first appearance of a definition of uniform convergence (not uniformly Cauchy) basically says that the absolute value of the sum of the remainder \( s - s_{n} \) should become \( < \delta \) for a\( n = n\left( \delta \right) \), but not explicitly for every number or infinitely many numbers \( \ge n \). Indeed, the existence of only one \( n = n\left( \delta \right) \) is sufficient for a proof of the continuity of \( \sum \varphi \left( x \right) \). However, it remains questionable if Weierstraß meant \( s - s_{n} < \delta \) to hold just for one \( n = n\left( \delta \right) \). We find further examples of this variation in later lecture notes and also with regard to the Cauchyan uniformity condition. Hence, I will not point out and discuss every occurrence.

Furthermore, Pasch’s definition is introduced in parentheses, which could mean that this concept was introduced earlier than in 1865-otherwise one would have to expect a more detailed introduction by Pasch.

Note that the common variants of index notations in today’s mathematics (e.g., setting the index of summation under the summation symbol) were not established yet and, therefore, either completely absent or implemented differently (see Fig. 3).

Fig. 3
figure 3

Weierstrass’s notation for a Laurent series, in Pasch’s script, p. 28v

5.2 Uniform convergence in a script of 1870/71

According to the university calendar, this script of a lecture course on elliptic functions from Schwarz’s Nachlass comes from the WS 1870/71. The author remains unknown. In contrast to the 1865 script by Pasch, Weierstraß explicitly emphasized the relevance of uniform convergence here:

If, however, we have to construct the integral of an infinite sum denoted by \( \sum \varphi \left( t \right) \), we can prove a very general theorem, which holds for most cases of applications [meaning the interchangeability of summation and integration; K.V.]. The conditions under which we can obtain a convergent series by integration are not yet well known.

We first understand by \( \varphi \left( t \right) \) only real functions of a real variable and we have to assume in any case that \( \sum \varphi \left( t \right) \) converges for all \( t \) considered. But this is not sufficient, but the series must converge at the same rate, i.e., it must be possible to separate a certain number of terms of the series such that the sum of the remainder becomes smaller than \( \delta \) for all values considered or, in other words, the sum of the remainder must have the same rate of exactness for all values of \( t \) considered.Footnote 35

This introduction seems to describe convergence at the same rate as in Pasch’s script, though it is mostly verbal and lacks the absolute values: The sum of the remainder (die Summe der übrigen) should become small after separating a “certain number of terms,” meaning \( s - s_{n} < \delta \) for a number \( n = n\left( \delta \right) \). Additionally, this type of convergence is uniquely named convergence with the same rate of exactness,” which is not used again in this manner in later lectures. Weierstraß gave several applications of the concept: The uniform convergence of a series of functions \( \varphi \left( t \right) \) yields the uniform convergence of the sum of integrals \( \int_{\tau }^{{\tau^{\prime}}} \varphi \left( t \right){\rm d}t \), if they exist:

[…] if \( \sum \varphi \left( t \right) \) converges at the same rate for all possible values of \( t \), then the sum of the terms \( \psi = \int_{\tau }^{{\tau^{\prime}}} \varphi \left( t \right){\rm d}t \), whose existence we assume, also converges, since we can prove that a finite number of the terms \( \psi_{1} \ldots \psi_{n} \) can be separated in such a way that the sum of arbitrarily many remainder terms satisfies \( \left| {\psi_{n + 1} + \cdots + \psi_{n + m} } \right| < \delta \), if \( \delta \) denotes a number assumed arbitrarily small. […] Thus, it is shown that if \( \sum \varphi \left( t \right) \) converges at the same rate, the same property holds for \( \sum \psi \left( t \right) \).Footnote 36

Here we have a variation of the Cauchyan uniformity condition, since the inequality \( \left| {s_{n + m} - s_{n} } \right| < \delta \) should hold for a number \( n = n\left( \delta \right) \) and arbitrary values of \( m \). But it seems as if \( \left| {s_{n + m} - s_{n} } \right| < \delta \) is only required to hold for one \( n \), which is sufficient for this theorem.

It was unusual for Weierstraß to specify the exact domains of convergence in a rigorous way, and here he just referred to all values considered. The following theorem gives another example: “If the infinite sum \( \sum f\left( x \right) \) converges at the same rate for all points in the neighbourhood of a certain point, then any definite integration on each term can be performed.”Footnote 37 Functions with complex variables are also considered similarly: “If the term by term differentiation of a series produces an apparently incorrect result, we must conclude that the series does not converge at the same rate for an area in the complex plane.”Footnote 38 In the next application, it is stated that a uniformly convergent series of rational functions is again rational, if only finitely many of these functions have poles in the domain. The uniform convergence should hold in the neighborhood of every point:

If there is only a finite number of functions \( \varphi \left( x \right) \) within this area bounded by the line just mentioned, which become infinitely large for points of this area, then \( \sum \varphi \left( x \right) \) is a function of rational character, provided that the condition of uniform convergence holds in the neighbourhood of every point \( x_{0} \).Footnote 39

He goes on to the following theorem by setting a disk centered at zero as a potential domain of convergence:

If we describe a circle with a finite radius around the zero point, this \( \sum \varphi \left( x \right) \) converges at the same rate in the neighbourhood of all points \( x_{0} \), except those for which \( \varphi \left( x \right) \) becomes infinite, and only a finite number of these terms \( \varphi \) become infinitely large for points within this circle.Footnote 40

It is worth noting that the first of these quotes is the first occurrence of the term gleichmäßig (uniform) in Weierstrass’s lectures, which is also the most common term in German textbooks today. Weierstraß used both formulations im gleichen Grade and gleichmäßig to denote uniform convergence, but there seem to be no differences in their meaning. The first term, originally used by Gudermann, disappears in all subsequent lecture notes on elliptic and analytic functions from 1870 onward and the formulation gleichmäßig became the most used in his lecture courses.

5.3 The development of uniform convergence between 1870 and 1874

The date when Weierstraß changed his terminology from im gleichen Grade to gleichmäßig is still unknown, but I assume it happened before 1870/71. At least, the applications of uniform convergence in the script of the WS 1870/71 course indicate a previous introduction in the course on analytic functions of 1870. Carl Johannes Thomae was the first to refer to gleichmäßige Convergenz in his textbook (1870), says Richter (1976, p. 187 f.). According to the university calendar, Weierstraß also held courses on both analytic and elliptic functions in the summer of 1872 and in the WS 1872/73, but for the terms from 1870/71 to 1873, there are no lectures notes known.

In this case, publications by Weierstrass’s colleagues and students can give further insights. H. A. Schwarz was a student of Weierstraß until 1864, and then he stayed in Berlin until 1867. In 1873, he published an article introducing the series \( \sum_{n} \frac{{[2^{n} x] + \sqrt {2^{n} x - [2^{n} x]} }}{{4^{n} }}, \) which represents a unique and everywhere continuous function, whose differential quotient is unbounded in every interval. In his lecture courses from 1861, Weierstraß had already rejected the general conviction that continuous functions are differentiable. Schwarz pointed out:

In 1861, Mr. Weierstrass gave the correct exposition of the true facts in the lectures on differential and integral calculus, which he held at the Trade Institute, according to which all attempts to generally prove the existence of a derivative for continuous functions of a real variable must be considered unsuccessful without exception.Footnote 41

Furthermore, Weierstraß had read a paper to the Berlin Academy in March 1872, where he gave his famous example of a continuous but nowhere differentiable function.Footnote 42 In order to prove continuity, Schwarz considered the convergence of the mentioned series:

The series converges at the same rate for all possible values of \( x \). Therefore, the property of uniqueness and continuity, which the individual terms of the series possess as functions of \( x \), is also transferred to their sum.Footnote 43

As we see, he referred to uniform convergence, and, as a close colleague of Weierstraß, one would expect him to use the same term for uniform convergence as his former teacher did in his research.Footnote 44Under this assumption, Weierstraß could have referred to convergence at the same rate at least until 1873.

5.4 Georg Hettner’s lecture notes from 1874

The German mathematician Georg Hermann Hettner was born in Jena, studied first in Leipzig and then moved to the University of Berlin. He earned himself a good reputation with Weierstraß and was occasionally granted the honor of writing Weierstrass’s lectures on the blackboard.Footnote 45 With that in mind, his script on analytic functions (1874) can be considered a fairly authentic reproduction of the original course. A novelty of Hettner’s script is the new way of introducing uniform convergence by addressing (what is called today) the double series theorem: “We can now find another criterion for converting a sum of power series into a power series. For this, we must introduce the new concept of uniformly convergent series.”Footnote 46 We take a look at how he defined this notion:

We had assumed that \( \sum f\left( x \right) \) [representing a power series; KV] converges in the neighbourhood of \( a \). Therefore, if we give to \( x \) a particular value of this neighbourhood, it is always possible to separate a finite number of terms in such a way that the sum of the remainder is smaller than an arbitrarily small quantity \( \delta \). If, after assuming a certain, arbitrarily small quantity \( \delta \) for every value of \( x \) in the neighborhood of \( a \) and after separating the same finite number of terms from \( \sum f\left( x \right) \), the sum of the remaining terms is less than δ, then we say \( \sum f\left( x \right) \) is not only convergent in the neighborhood of a, but uniformly convergent.Footnote 47

Weierstraß evidenced here different practices in mathematical rigor: On the one hand, we have a formalization for the upper bound for the sum of the remainder; on the other hand, there is no mathematical formalization used for the said sum such as he had used in previous lectures (see Pasch’s script and the 1870/71 script on elliptic functions). In a modern, condensed form, he said that sum \( s - s_{n} \) should be smaller than \( \delta \) for a \( n = n\left( \delta \right) \), which is similar to the definition found in Pasch’s script, apart from the missing absolute values.

Furthermore, this introduction shares some similarities to a passage in Eduard Heine’s article Ueber trigonometrische Reihen (1870): Heine also explicitly distinguished between (pointwise) convergence and uniform convergence by introducing the latter as an expansion of the concept of convergence:

A series of terms \( g_{1} ,g_{2} ,g_{3} , \cdots g_{m} \) is called convergent, if, for an assignable value \( n \) and for always increasing values of it, the sum \( g_{n} + g_{n + 1} + g_{n + 2} + \cdots + g_{n + m} \) sinks below any arbitrarily given value \( {{\varepsilon }} \) different from zero for every positive number given to \( m \).

If the terms \( g \) depend on a single variable \( x \), which is allowed to pass through all values from \( \alpha \) to \( \beta \) (including \( \alpha ,\beta \)), then, within these limits, the series is called convergent at the same rate, if the criterion of convergence holds for each given \( \varepsilon \), while \( x \) passes through all values, for the same \( n \); one has only to take another value for \( n \) if another \( \varepsilon \) is given.Footnote 48

Heine’s presentation of convergence at the same rate (in a closed interval) describes the Cauchyan uniformity condition in almost the same way as Schwarz referred to it in 1861: In modern terms, the inequality \( s_{n + m} - s_{n - 1} < \varepsilon \) should hold for every \( n \ge N = N\left( \varepsilon \right) \) and every \( m \).

Did he attend to Weierstrass’s lecture courses? Probably not, since he graduated in 1842 and was appointed as a professor in Halle on the September 6, 1856. Nonetheless, as a friend of Weierstraß, he likely knew some of the existing lecture notes. Heine’s publication includes both terms gleichmäßig and in gleichen Grade and Weierstraß most likely used both terms before 1870.Footnote 49

5.5 Two lecture notes on analytic functions from 1878

5.5.1 Rudio’s script

Until 1874, Weierstraß used uniform convergence as a tool to solve problems in the context of infinite series of continuous functions, originating from the continuity theorems. The lecture courses on analytic functions (1878)—and the notes taken by Ferdinand Rudio and Adolf Hurwitz—show an increasing focus on uniform convergence as a more original concept.

Born in Wiesbaden, Germany, Rudio first studied mathematics and physics at the Federal Polytechnic College of Zürich (Polytechnikum Zürich) before he moved to Berlin in 1877. Three years later, he finished his doctorate. In this period, he attended the seminars of Weierstraß. He then returned to the Polytechnikum, and in 1889, he became a full professor of mathematics. Due to his contributions, he was awarded an honorary doctorate by the University of Zürich in 1919. The notion of uniform convergence takes up a more important part in his script than in previous lecture notes. It has become a subject of its own as it is introduced in a section titled “Erweiterung des Functionenbegriffs. Gleichmäßige Konvergenz.”Footnote 50 The following definition, specified for functions with several variables, shows a new level of contextualization and, as well, a possible misunderstanding by Rudio:

We define this important notion as follows: the sum of infinitely many rational functions of the variables \( x_{1} ,x_{2} , \ldots ,x_{n} \) is uniformly convergent for a specific area of these variables, if, after assuming an arbitrarily small quantity \( \delta \), one can find the same finite number \( n \) for all systems of values \( x_{1} ,x_{2} , \ldots ,x_{n} \) from the given area, such that \( \left| {s - s_{n} } \right| < \delta \). At the same time, this definition gives a justification of the word uniformly, [since] it should be possible to separate the same number of terms for all systems of values \( x_{1} ,x_{2} , \ldots ,x_{n} \) from the given area, such that the error [\( \left| {s - s_{n} } \right| \), KV] does not exceed a given limit \( \delta \). First, one realizes that uniform convergence yields unconditional convergence. In fact, the concept of uniform convergence is more general. Therefore, in order to justify the assertion just made, we have yet to prove that uniform convergence also implies continuity (author’s emphasis).Footnote 51

He claimed that uniform convergence implies unconditional (unbedingte) convergence (which is generally wrong) and added that the former is the more general concept of the two. I did not find any indications in other lecture notes that this could represent Weierstrass’s conception of both notions of convergence. We find a second definition of uniform convergence in the section titled “Potenzreihen und die Criterien ihrer Convergenz”:

If we assume the summation of infinitely many rational functions with several variables, we call the resulting series uniformly convergent, if, after assuming an arbitrarily small quantity \( \delta \), one can find a number \( n \) for all combinations of values \( x_{1} ,x_{2} , \ldots ,x_{n} \) from a given domain of \( 2n \)th dimension, such that \( \left| {s - s_{n} } \right| < \delta \) (author’s emphasis).Footnote 52

In both introductions, Weierstraß emphasized the independence of \( n \) over all values of \( x \)—without an indication on the variable concerned, i.e., \( s_{n} \left( x \right) \)).Footnote 53 This could not be found in earlier lecture notes. And we also see an increase in mathematical rigor due to a consistent use of symbolizations like \( \delta \) as well as the sign of absolute value, but we also find Weierstraß picked up the Cauchy-influenced denotations \( s,s_{n} \) for (partial) sums. These more general notations helped making the concept of uniform convergence applicable to products and, at some point, to sequences of functions. Its increasing integration into the subject matter can be seen in the theorems that are based on uniform convergence: We find the double series theorem (Nachlass Schwarz, No. 464, p. 284), a theorem about the term by term differentiation of sums of infinitely many power series (p. 291), and a false statement about the uniform convergence of power series (p. 205):

So, \( \sum_{n = 0}^{n = \infty } a_{n} x^{n} \) is unconditionally convergent for all \( x \), whose absolute value is smaller than the absolute value of \( x_{0} \). But then the power series is also uniformly convergent in this range […].Footnote 54

The last sentence is formulated imprecisely, since unconditional convergence does not imply uniform convergence for the same set of values. This statement could also be the result of Rudio misunderstanding Weierstraß.

Furthermore, we find an introduction of uniform convergence for infinite products, specifically for products of power series that was probably motivated by the double series theorem:

Let us now examine under which condition an infinite product of power series represents an analytic function. Let \( \varphi \left( x \right) = \prod_{\lambda } \varphi_{\lambda } \left( x \right) \). If we set \( \varphi_{\lambda } = 1 + \psi_{\lambda } \), then \( \prod_{\lambda } \left( {1 + \psi_{\lambda } } \right) \) can be replaced by the series

$$ \left| {\begin{array}{*{20}l} {1 } \\ {\psi_{0} } \\ {\begin{array}{*{20}l} {\psi_{2} + \psi_{1} \psi_{2} } \\ {\begin{array}{*{20}l} {\psi_{3} + \psi_{1} \psi_{3} + \psi_{2} \psi_{3} + \psi_{1} \psi_{2} \psi_{3} } \\ { \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots .. \ldots \ldots .} \\ \end{array} } \\ \end{array} } \\ \end{array} } \right.. $$

The uniform convergence of the product will then be defined by the uniform convergence of this series. The uniform convergence of the product is expressed by \( p - p_{n} < \delta \) and this is also the condition of uniform convergence for the series that defines the infinite product.Footnote 55

One sees the similarities to the definition of uniform convergence for infinite series on p. 201 f.: He chose the notations \( p \), \( p_{n} \) in dependence of Cauchy’s original notations \( s \), \( s_{n} \) for infinite series and sums.

5.5.2 Hurwitz’s script

Adolf Hurwitz, born in Hildesheim, Germany, shortly attended the University of Munich in 1877 before he went to Berlin for 1877–1878 to study under Kummer, Kronecker and Weierstraß. He returned to Munich in 1879 and followed his teacher Felix Klein to the University of Leipzig one year later. After his Habilitation in 1882 and further studies in Berlin in 1881–1882, he accepted an appointment as an extraordinary Professor at Königsberg in 1884. During the following years, David Hilbert and Hermann Minkowski became his students, and they both became good friends of Hurwitz. In 1892, Hurwitz was appointed to the Eidgenössisches Polytechnikum Zürich where he stayed until 1919.

During his first Berlin visit in 1877–1878, he attended Weierstrass’s lecture on Theorie der analytischen Functionen. The following quotation from Hurwitz’s script shows the introduction of uniform convergence, which can easily be converted to functions with several variables, as Weierstraß mentionedFootnote 56:

[we assume] that the series \( f_{1} \left( x \right), f_{2} \left( x \right), \cdots \) converges uniformly for all values of \( x \), for which it has a sum at all. We know that if the series converges for \( x = x_{0} \), a limit \( \nu \) for \( n \) can be determined such that for every \( n > v \) \( \left| {s_{0} - \left( {f_{1} \left( {x_{0} } \right) + f_{2} \left( {x_{0} } \right) + \cdots f_{n} \left( {x_{0} } \right)} \right)} \right| < \delta \) holds, where \( s_{0} \) is the sum of the series for \( x = x_{0} \) and \( {{\delta }} \) denotes an arbitrarily small quantity. Now, if a limit \( N \) can be set for \( n \) such that for every \( n > N \) \( \left| {s - s_{n} } \right| < \delta \) holds for all values of the variable included in the domain of validity of the series \( f_{1} \left( x \right) + f_{2} \left( x \right) + \cdots \), then this latter series is called uniformly convergent.Footnote 57

As we already saw in Rudio’s script, Weierstraß uses a Cauchy-influenced notation for series and sums here. The term \( s_{0} \), which denotes the value of \( s \) for \( x = x_{0} \) in the context of uniform convergence, only appears in the context of uniform convergence in this script. Obviously, this resembles Cauchy’s notation \( s \), but modified in order to indicate the variable, which may not be confused with the general meaning of \( s_{n} \). It seems as if Weierstraß did not adopt a symbolic notation for a function represented by an infinite sum and with an indication on the variable. However, we find such a notation for a function represented by an infinite product, i.e., \( \varphi \left( x \right) = \prod_{\lambda } \varphi_{\lambda } \left( x \right) \) (Nachlass Schwarz, No. 464, p. 291 f.). But most importantly, the above definition of uniform convergence represents one of the most rigorous, since, for the first time, it is required that the sum of the remainder should become \( < \delta \) for all \( n \) greater than \( N = N\left( \delta \right) \). In both of Rudio’s introductions of uniform convergence, it is only said that “one can find a number \( n \).”

As it was already pointed out, multiple scripts of one lecture course increase the reliability of what Weierstraß actually said when he lectured. In terms of Cauchy-influenced notations, it is most likely that Weierstraß really referred to them in his 1878 course, since the inequality \( \left| {s - s_{n} } \right| < \delta \) can be found in both Rudio’s and Hurwitz’s scripts. However, he simultaneously used the alternative notation \( \left| {s_{0} - \left( {f_{1} \left( {x_{0} } \right) + f_{2} \left( {x_{0} } \right) + \cdots f_{n} \left( {x_{0} } \right)} \right)} \right| < \delta \) in Hurwitz’s notes (Weierstraß 1878, pp. 59–60). Weierstraß gave another short explanation: “The condition can also be expressed as: \( \left| {f_{n + 1} \left( x \right) + f_{n + 2} \left( x \right) + \cdots in\;inf.} \right| < \delta \) holds for a sufficiently large number \( n \) and for each value of \( x \) within the area of validity and convergence of the function.”Footnote 58 Again, from today’s point of view, it is not sufficient for uniform convergence to require \( \left| {f_{n + 1} \left( x \right) + f_{n + 2} \left( x \right) + \cdots in\;inf.} \right| < \delta \) for a sufficiently large \( n \). Weierstraß, or the note-taker, did not seem to prioritize between the different definitions on pp. 59–60, although they differ in terms of mathematical rigor and content. One might conclude that the note-taker considered both definitions as equivalent or he did not pay much attention to the exactness of the verbal form. But then, the continuity theorem still holds when the sum of the remainder becomes arbitrarily small for a sufficiently large number \( n = n\left( \delta \right) \). Nonetheless, these definitions are novelties in the cycle of lectures on analytic and elliptic functions and they show a further development of the concept of uniform convergence in terms of rigor.

In the section titled “Unendliche Summen und Produkte analytischer Funktionen“, Weierstraß explained uniform convergence both for power series and analytic functions:

Here is something to be said about uniform convergence: A power series of \( x \) or a series of infinitely many arithmetic expressions of \( x \) is called uniformly convergent, if, for an arbitrarily small quantity \( \delta \), a number N can be determined such that for \( n \ge N \) \( \left| {s - s_{n} } \right| < \delta \) holds for every value of \( x \), for which the series has a finite value.Footnote 59

It is stated that if a power series is uniformly convergent, then this should hold for every value in the area of convergence. This seems to be a misunderstanding of the note-taker Hurwitz, since we only get uniform convergence in every closed and bounded subset of the disc of convergence. Weierstraß had already mentioned power series that converge only in a certain subset of their domain, e.g., here:

A series \( P\left( x \right) \) defining an analytic function should be called uniformly convergent in the neighbourhood \( \delta \) of \( x_{0} \), if \( P\left( {x{\mid }x_{0} } \right) \) is uniformly convergent for \( \left| {x - x_{0} } \right| < \delta \).Footnote 60

For other theorems, we find the continuity theorem (“we now want to show that every function \( \varphi \left( x \right) = f_{0} \left( x \right) + f_{1} \left( x \right) + \ldots \), which converges uniformly, changes continuously with its variable”Footnote 61) and a theorem stating that a series of analytic functions is again analytic (Weierstraß 1878, p. 113 f.).

Similar to Rudio’s script, Weierstraß also dealt with infinite products of power series denoted as \( \prod_{\lambda = 0}^{\infty } \varphi_{\lambda } \left( x \right) \) and the requirements under which they represent analytic functions. These remarks on power series and their convergence are very similar to the respective passages in Rudio’s script, as one would expect of two scripts of the same lecture course. Despite the similarities, uniform convergence of products is introduced slightly more rigorously here due to the use of symbolic notations and absolute value signs. Let \( \sum \psi_{\lambda } \) be a uniform convergent sum of power series denoted as \( \psi_{\lambda } \), and let \( \left( {1 + \psi_{0} } \right)\left( {1 + \psi_{1} } \right)\left( {1 + \psi_{2} } \right) \ldots \) be an infinite product that can also be written as \( 1 + \psi_{0} + \psi_{1} + \psi_{0} \psi_{1} + \ldots \). Then, Weierstraß added:

We give the definition of uniform convergence of a product \( p \) in such a way that \( \left| {p - p_{n} } \right| < \delta \) should hold, if \( \delta \) is an arbitrarily small quantity and \( p_{n} \) is the product of the first \( n \) terms of the infinite product. In fact, the number \( n \) must be determined by \( \delta \) such that \( \left| {p - p_{n} } \right| < \delta \) holds for every value of the variable, for which \( p \) has a value.Footnote 62

In this explanation, uniform convergence for infinite products is stretched to the whole domain of convergence, which points to a mistake made by Hurwitz. The denotations \( p \), \( p_{n} \) have the same meaning as in Rudio’s script, and it is claimed that \( \left| {p - p_{n} } \right| < \delta \) holds for every value of \( x \), while the number \( n \) depends on the quantity \( \delta \). As in Rudio’s script, it remains unclear for how many numbers \( n \) \( \left| {p - p_{n} } \right| < \delta \) should apply.

Hurwitz’s and Rudio’s introductions of uniform convergence of infinite products are clearly motivated by the double series theorem, and we will see a more original approach to convergent products in Kneser’s script.

5.6 Weierstrass’s lectures and research between 1880 and 1886

Weierstrass’s state of health deteriorated over the years, so that he was forced to cancel all his lecture courses between 1883/1884–1885/1886 and in 1887/1888. Except for a few dates in January 1889 this was the end of his lectures at the University of Berlin (Biermann 1988, p. 140). This means that his lecture courses on analytic and elliptic functions from the early 1880s are particularly interesting in revealing developments of the notion of uniform convergence.

5.6.1 Adolf Kneser’s script of a lecture on analytic functions

Adolf Kneser studied mathematics in Heidelberg, Rostock and Berlin, where he also obtained his doctorate under Leopold Kronecker and Ernst E. Kummer in 1884. The certificate of matriculation from the archives of the University of Berlin reveals that he also attended Weierstrass’s lectures and, particularly, the course on analytic functions (WS 1880/81).Footnote 63 Weierstrass’s introduction of uniform convergence is given as follows in Kneser‘s lecture notes:

A series converges uniformly within a certain domain \( B \) of \( x \), if it is always possible, after assuming an arbitrarily small quantity \( \varepsilon \), to separate a number of \( n \) first terms such that the absolute value of the sum of the remainder always becomes \( < \varepsilon \) for all \( x \) within \( B \).Footnote 64

On the one hand, this is the first occurrence of the term gleichförmig (uniform) in the cycle of lectures. On the other hand, it is the first time that the domain where convergence applies is designated by a letter. Furthermore, this definition again shows that rigor and formalization did not continuously increase in his lecture courses: Weierstraß used a purely verbal formulation of the sum of the remainder instead of a symbolic notation (e.g., \( \left| {s - s_{n} } \right| \)) as introduced in Hurwitz’s (SS 1878), Hettner’s (SS 1874) and Pasch’s notes (WS 1865/66). Even more, rigor and formalization were handled differently even in the same course, which is shown by the following alternative and more rigorous notation for the sum of the remainder: “From the assumption of the uniform convergence of \( F\left( x \right) \) [which equals \( \sum_{v = 0}^{\infty } f_{v} \left( x \right) \); K.V.], it follows that \( \left| {\sum_{v = m}^{\infty } f_{v} \left( x \right)} \right| < \delta \) holds for a certain number \( n \), as soon as \( m \ge n \).”Footnote 65 This definition is basically saying that \( \left| {s - s_{m - 1} } \right| < \delta \) should hold for (maybe) any number \( m \ge n \), instead of just for \( n \) as it was said in former lecture notes. Compared to previous notations such as \( s - s_{n} \), \( \sum \varphi \left( x \right) - \varphi_{1} \left( x \right) - \ldots - \varphi_{n} \left( x \right) \) (Pasch’s script, WS 1865/1866), \( \psi_{n + 1} + \ldots + \psi_{n + m} \) (elliptic functions, WS 1870/1871), and \( s_{0} - \left( {f_{1} \left( {x_{0} } \right) + f_{2} \left( {x_{0} } \right) + \cdots f_{n} \left( {x_{0} } \right)} \right) \) (Hurwitz’s script, SS 1878), this is another example of further refinements of the notation for uniform convergent series. Concerning the index-based sum notation used here, this was found earlier in the WS 1868/1869 script on elliptic functions (see Fig. 4).Footnote 66

Fig. 4
figure 4

Weierstrass’s notation for an infinite series. Nachlass Schwarz, No. 470, p. 71

We find the following theorems using uniform convergence in Kneser’s script:

  • A continuity theorem, in which the continuity of the original functions in the series is not explicitly claimed: “If a series converges uniformly within a certain domain, then its value changes continuously in this domain-the series is \( \sum_{v = 0}^{\infty } f_{v} \left( x \right) = F\left( x \right) \)Footnote 67,

  • a theorem stating that power series converge on the boundary of a specific disk: “Now let \( F = \sum_{\lambda = 0}^{\infty } A_{\lambda } x^{\lambda } \) be a power series convergent for all \( \left| x \right| < r \); if \( \left| \rho \right| < r \), the series converges uniformly for all \( \left| x \right| = \rho \)“,Footnote 68

  • a theorem on determining the product of two uniformly convergent power series,Footnote 69 and

  • a statement on the convergence of power series with several variables: “If the absolute values of all terms of a power series with several variables [designated as \( \sum a_{\lambda \mu \nu \ldots } x^{\lambda } y^{\mu } z^{\nu } \ldots \); K.V.] are below a certain positive quantity \( g \) for \( x = x_{0} \), \( y = y_{0} \), \( z = z_{0} \) … which are all not equal to zero, then the series converges for every system of values that satisfies \( \left| x \right| < \left| {x_{0} } \right| \), \( \left| y \right| < \left| {y_{0} } \right| \ldots \) . If we have any quantities \( h < \left| {x_{0} } \right| \), \( k < \left| {y_{0} } \right| \), \( l < \left| {z_{0} } \right| \ldots \) and if we require \( \left| x \right| < h \), \( \left| y \right| < k \), \( \left| z \right| < l \ldots \), then the series converges uniformly in the domain defined thereby.”Footnote 70

Kneser’s script also shows a more formalized introduction of uniform convergence for infinite products compared to both Hurwitz’s and Rudio’s introductions. That is, infinite products are not solely introduced as representations of their respective series:

If the infinite product \( F\left( x \right) = \prod_{v = 0}^{\infty } f_{v} \left( x \right) \) has a finite value, then we have

$$ \prod_{v = n}^{\infty } f_{v} \left( x \right) = 1 + q_{n} , $$

in which \( q_{n} \) becomes infinitely small for \( n = \infty \). If we have \( q_{n} < \varepsilon \) for all \( x \) satisfying the condition \( x - x_{0} < \delta \) as soon as \( n \) has reached a certain value, given an arbitrarily small quantity \( \varepsilon \), then one says that the product converges uniformly in the neighbourhood \( \delta \) of \( x_{0} \).Footnote 71

Compared to 1878, this introduction of a uniformly convergent product does not require its corresponding series. It also seems to be clearer to the reader that \( q_{n} \) is required to become small for any number greater than or equal to a certain value.

5.6.2 Georg Thieme’s script of a lecture on analytic functions from WS 1882/1883

The last lecture notes from the regular lecture courses of analytic and elliptic functions available are preserved in Schwarz’s Nachlass and are titled “Einleitung in die Theorie der analytischen Functionen. Vorlesungen des Professor Dr. Weierstrass Berlin W. S. 1882–83.” The name of the author on the front page reads “G. Thieme,” but there was no student of Weierstraß named Thieme at this time. Hence he had bought these notes. Siegmund-Schultze (1988) points out that G. Thieme was Georg Thieme a professor at the college for mining engineers in St. Petersburg.Footnote 72 Thieme also owned a transcription of a lecture course on Ausgewählte Kapitel aus der Funktionenlehre (1886). In Thieme’s manuscript, uniform convergence is introduced as a sufficient condition for the continuity theorem. It is asked, “which conditions must be satisfied for \( \sum f\left( x \right) \) to be a continuous function. So far, the proof could only be provided in the case that the series is uniformly convergentFootnote 73. For this purpose, he first recapped common convergence before explaining the property of uniform convergence for a sum of rational functions:

One has a series with a single variable. Let \( x \) be a certain value of the same variable in the range of convergence […]. One then knows that the absolute value of \( S - S_{n} \) becomes smaller than an arbitrarily small quantity \( \delta \) as soon as \( n \) reaches or exceeds a certain limit \( m \). This applies to every single value within the range of convergence. We now say that a sum of rational functions of \( x \) converges uniformly within a certain domain, if, after assuming an arbitrarily small quantity \( \delta \), it is possible to determine a number \( n \) such that \( \left| {S - S_{n} } \right| < \delta \) holds for all \( x \) within the domain considered.Footnote 74

The difference between uniform and pointwise convergence is clearly stressed here with the use of the formulation “\( \left| {S - S_{n} } \right| < \delta \) holds for all \( x \).” In another introduction, the range of convergence is specified by the neighborhood of a fixed value:

Let \( a \) be a certain value for which the series converges. We then say that the series converges uniformly in the neighbourhood of \( a \), if a positive quantity \( \delta \) can be determined such that for all \( x \) satisfying the condition \( \left| {x - a} \right| \le \delta \), the series converges uniformly.Footnote 75

And for infinite series, Weierstraß stated that uniform convergence in a neighborhood of a given value \( a \) holds, if there exists a \( \delta > 0 \) such that the series converges uniformly for all \( \left| {x - a} \right| \le \delta \).Footnote 76

Leaving a Cauchy-like notation for series behind (perhaps to avoid redundancy), Weierstraß introduced uniform convergence in the neighborhood of a fixed value also for functions of multiple variables with a particularly high level of mathematical rigor as compared to some earlier scripts:

We consider the [following] infinite series:

$$ F\left( {x x_{1} x_{2} .. x_{\tau } } \right) = \sum_{\gamma = 0}^{\infty } f_{\gamma } \left( {x x_{1} , .. x_{\tau } } \right) $$

Let \( f \) be rational functions of the given variables. If one arbitrarily assumes a point \( a a_{1} .. \) from the domain of the given variables, then it is said that the series converges uniformly in the neighbourhood of \( a a_{1} .. \), if a quantity \( \delta \) can be determined such that for every system \( x x_{1} .. \) satisfying the condition

$$ \left| {x - a} \right| < \delta , \left| {x_{1} - a_{1} } \right| < \delta , \ldots $$

the absolute value of the difference between the sum of the series and the sum of the \( n \) first terms is smaller than the arbitrarily small quantity \( \varepsilon \), as soon as \( n \) reaches or exceeds a certain limit \( m \).Footnote 77

So, the concept is introduced here with a level of rigor comparable to Kneser’s script, since we have \( \left| {s - s_{n} } \right| < \varepsilon \) should hold for numbers \( n \) which are \( \ge m = m\left( \varepsilon \right) \).

5.6.3 Weierstraß’s supplementary lecture from 1886

Because of Weierstraß’s steadily declining state of health from the second half of the 1880s, a regular continuation of his lecture courses was no longer possible and his retirement became imminent. Therefore, we close this overview with a discussion of a script from his final lecture course on the subject of analytic functions, entitled Ausgewählte Kapitel aus der Functionenlehre (1886).

Weierstraß planned this course as a singular follow-up to his lecture course on analytic functions (WS 1884/85), and it was held between the 25th of May and 3rd of August 1886. Unfortunately, the author of these notes remains unknown.Footnote 78 Compared to other lecture notes discussed here, the 1886 script provides a first step toward a more general concept of uniform convergence, not only for infinite series (and products), but for sequences. It is introduced (quite uniquely) as “uniform approximation” and applied to a certain sequence \( F\left( {x,k} \right) \) that converges to \( f\left( x \right) \) for \( k \to 0 \).Footnote 79 He said

if we set two finite limits \( a \) and \( b \) for \( x \), our theorem says that after assuming an arbitrarily small quantity \( \varepsilon \), we can set an upper limit \( \overline{k} \) for \( k \) in such a way that for any given value of \( x \) within the mentioned interval, we obtain \( \left| {F\left( {x,k} \right) - f\left( x \right)} \right| < \varepsilon \). In this case, one says that the approximation is uniform for every value of \( x \) within the finite interval \( \left( {a \ldots b} \right) \).Footnote 80

It is not specified whether the limits of the intervals noted as \( \left( {a\ldots b} \right) \) are included or not, but the editor Siegmund-Schultze suggests interpreting them here (and in Weierstrass 1885) as closed intervals.

After Thomé’s introduction of a preliminary stage of Weierstrass’s M-test in 1866, we see Weierstraß himself referring to this important comparison test in this 1886 lecture course, again for functions with several variables: One says that \( \sum_{\lambda = 1}^{\infty } f_{\lambda } \) is uniformly convergent if there is a convergent series of positive values \( \sum_{m = 1}^{\infty } k_{m} \) such that \( \left| {f_{m} } \right| < k_{m} \) holds for all \( m \) larger than an assignable number (Weierstraß 1886a, p. 107).

6 Conclusions

6.1 Different formulations of uniform convergence

From the beginning of Weierstrass’s lectures at the University of Berlin, we find the formulation im gleichen Grad (at the same rate) for his concept of uniform convergence. This term was first used by his teacher Christoph Gudermann (1844) (and again in Reinhold Hoppe’s 1865 textbook) in order to refer to the rate of convergence. Neither Gudermann nor Hoppe introduced a new concept with this term or, at least, they did not unfold it in their works. This indicates that both authors referred to the common concept of convergence.

It is widely believed today that Weierstraß adopted the notion of uniform convergence from Gudermann. But he just adopted the wording im gleichen Grad for a new concept of convergence he was going to develop throughout his lecture courses. It is also believed that Weierstraß had fully developed the concept of uniform convergence by 1841, but the main point overlooked here is the later revision of the corresponding article. The introduction of uniform convergence and maybe the theorems stated by Weierstraß in the original paper from 1841 were clearly revised (see Sect. 2.2). However, he had referred to convergence at the same rate at least since his course on Differentialrechnung (1861; see Sect. 4) and he might have introduced the concept some years before.

In the lecture notes on analytic and elliptic functions, three different phrasings for uniform convergence were used, which are, in the order of their emergence, im gleichen Grade, gleichmäßig and gleichförmig (uniform). The term im gleichen Grade was the only formulation he used for uniform convergence until his 1870/71 course on elliptic functions. From a modern, more rigorous perspective, most of these definitions of im gleichen Grade describe the Cauchyan uniformity condition, which is, for the functions Weierstraß considered, equivalent to uniform convergence. The corresponding definitions found here can be summarized as follows: The sum \( s_{n + m} - s_{n} \) (A) or \( \left| {s_{n + m} - s_{n} } \right| \) (B) should become \( < \delta \) for a certain number \( n \) (1) or for every number \( n \ge N = N\left( \delta \right) \) (2) and, in every definition found, for every number \( m \) (\( m > 0) \). We find different combinations of (A), (B) and (1), (2) as definitions of what is the Cauchyan uniformity condition. None of the note-takers and other authors mentioned a possible distinction between convergence in the sense of the uniform Cauchy condition and uniform convergence.

In the 1870/1871 course, Weierstrass first used the term gleichmässig for uniform convergence, which then appeared permanently until 1886. This formulation emerged simultaneously with new theorems about uniformly convergent series of functions with complex variables in the 1870/1871 script on elliptic functions. Its emergence also seems to correspond with the introduction of the absolute sign to the definition of uniform convergence. After then, the original term, im gleichen Grade, completely disappeared from both lecture cycles and so did the uniform Cauchy condition. The third formulation, gleichförmige Konvergenz, was used in 1880/1881–1886. As Richter emphasizes (1976, p. 159), these terms do not refer to different meanings of uniform convergence, although Siegmund-Schultze claimed that there are differences in terms of real and complex functions in Weierstrass’s 1886-lecture (Siegmund-Schultze 1988, p. 18) by shortly referring to a paper by Medvedev (1974).Footnote 81 From my understanding, Weierstraß did not distinguish conceptually between the two terms gleichförmig and gleichmäßig.

The definitions of uniform convergence (excluding the Cauchyan uniformity condition) are basically of the following types: The remainder \( s - s_{n} \) (A) or \( \left| {s - s_{n} } \right| \) (B) should become \( < \delta \) for a number \( n = n\left( \delta \right) \) (1) or for every number \( n \ge N = N\left( \delta \right) \) (2). Throughout the lecture notes, we find all combinations between (A), (B) and (1), (2). This shows that the development of the concept in terms of rigor did not steadily increase and, generally, introductions or short explanations of uniform convergence often varied between verbal forms and symbolic notations.

6.2 Theorems on uniform convergence

The first applications found in the lecture courses at the Gewerbeinstitut include the original questions that initially raised the awareness for new notions of convergence: We find the continuity theorem (the sum of infinitely many continuous functions is again continuous) here and in many lecture notes until 1882/1883. Also, with few exceptions, we find theorems about the permutability of summation and differentiation (or integration), which first occurred in the 1861 course at the Gewerbeinstitut and which were repeatedly introduced in other lecture notes until 1886. In the 1874 course, another recurrent theorem was added, which we now call the double series theorem of Weierstraß. It can also be found in both scripts of the 1878 course and, prominently, in the article Zur Theorie der Potenzreihen. Note that the continuity theorem as well as the theorem on permutability of summation and integration can still hold if \( \left| {s - s_{n} } \right| < \delta \) is only satisfied for a number \( n = n\left( \delta \right) \). This suggests that the definitions of (uniform) convergence were primarily customized for their application in certain theorems.

In terms of criteria for uniform convergence, we find the so-called Weierstraß M-test, which is first mentioned in his article Definition analytischer Functionen einer Veränderlichen vermittelst algebraischer Differentialgleichungen (Weierstraß 1842, p. 81). Since this paper was revised for a publication in 1894, we cannot conclude whether he already referred to the M-test in the 1840s. A second reference can be found in Über die analytische Darstellbarkeit sogenannter willkürlicher Functionen einer Veränderlichen (1885)Footnote 82, which shows that the M-test was not new in 1885 since Weierstraß did not explicitly introduce it to the reader. According to Reinhold Remmert (1984, 74), Weierstraß basically introduced the idea of this criterion in his article Zur Functionenlehre (1880):

This condition [of uniform convergence; K.V.] is certainly met if there is a series of certain positive quantities \( g_{1} ,g_{2} ,g_{3} , \ldots \) for which we can state that for each point in the domain \( B \) we have

$$ \left| {f_{v} } \right| \le g_{v} , \left( {v = 0, \ldots \infty } \right) $$

and the sum

$$ \sum_{v = 0}^{\infty } g_{v} $$

has a finite value.Footnote 83

This introduction comes as a footnote, which suggests that Weierstraß had already referred to the M-test in earlier works. But, apart from the supplementary lecture held in 1886, it was not used in the other lecture notes discussed here. Thomé used a preliminary version of this criterion in 1866 and he probably learned about it in a script from Weierstraß.

How did Weierstraß himself reflect on the development and reception of uniform convergence? In his role as a co-founder of the concept of uniform convergence, he commented on the reception in France in a letter to H. A. Schwarz from 1881 as follows:

During the Easter holidays, I will publish a new revised and enlarged edition of my memoir on the unique functions, which I have been asked to do by many colleagues, including from France. In the case of the French, my last memoir has received more attention than it actually deserves [cf. Weierstraß 1876; K.V.]. It seems that the importance of the concept of uniform convergence has finally been understood.Footnote 84

The level of reception in textbooks was reflected by Camille Jordan and the substantially reworked edition of his Cours d’analyse de l’École Polytechnique. In his Calcul différentiel, he utilized the concept of uniform convergence in order to develop the theorems on differentiation and integration of uniform convergent series that Weierstraß introduced in his lecture courses. He also referred to Weierstrass’s example of a continuous but nowhere differentiable function (Jordan 1893, pp. 310 ff.). For a detailed overview of contemporary developments and reception of Weierstrass’s concept of uniform convergence, see Viertel (2014, chapter 7).

7 Sources

Universitätsbibliothek Bielefeld, Sonderaufstellung QC

  • QC081+QC069 W418, Mitschrift der Vorlesung von K. Weierstraß: Ausgewählte Kapitel aus der Funktionenlehre. Unbekannter Autor. Sommer-Semester 1886, 1886.

Archiv der Berlin-Brandenburgischen Akademie der Wissenschaften, Berlin [BBAW]

  • Nachlass H. A. Schwarz

    • 453, G. Schmidt, Mitschrift der Vorlesung von K. Weierstraß: Differential- und Integralrechnung. 1859. 1 H., 35 B.

      454, Theiler, Mitschrift der Vorlesung von K. Weierstraß (?): Differentialrechnung. 1868, 1869. 3 H. (Nr. II, III, IV).Footnote 85

      463 L. Kiepert, Mitschrift der Vorlesung von K. Weierstraß: Analytische Funktionen. 1868

      464, F. Rudio, Mitschrift der Vorlesung von K. Weierstraß: Analytische Funktionen. 1878.

      465, G. Thieme, Mitschrift der Vorlesung von K. Weierstraß: Analytische Funktionen. 1882/83.

      466, Mitschrift der Vorlesung von K. Weierstraß: Analytische Funktionen. Ohne Datum, Schreiber unbekannt.

      470, Mitschrift der Vorlesung von K. Weierstraß: Elliptische Funktionen. 1868/69. Schreiber unbekannt.

      471, Mitschrift der Vorlesung von K. Weierstraß: Elliptische Funktionen. 1870. Schreiber unbekannt.

      1175, Brief von Weierstraß an H. A. Schwarz vom 6. März 1881.

Göttingen State and University Library, HANS (Handschriften, Autographen, Nachlässe, Sonderbestände)

  • Nachlass Adolf Kneser, Signatur Cod. Ms. A. Kneser B 3, K. Weierstrass, Adolf Kneser [Schreiber]: Vorlesungsnachschrift: Einleitung in die Theorie der analytischen Functionen. Berlin, WS 1880/81.—Ms.; 3, 97 Bl. (geh.)

University Library Gießen, Sondersammlungen

  • Nachlass Moritz Pasch, Signatur I, 19: Nachschrift einer Weierstraß-Vorlesung v. Moritz Pasch. Berlin, 1865/66.

Humboldt-Universität zu Berlin, Universitätsarchiv, Rektor & Senat

  • Gedruckte Vorlesungsverzeichnisse der philosophischen Fakultät von 1865 bis 1883.

Institut Mittag-Leffler, Section Lecture Notes [IML]

  • H. A. Schwarz, Vorlesungsmitschrift von K. Weierstraß: Differentialrechnung. SS 1861.

University Library Bielefeld [UB]

  • Signatur QA069 W418: Einleitung in die Theorieen der analytischen Functionen / von Prof. Dr. Weierstrass; nach den Vorlesungen im S.S. 1874 ausgearbeitet von G. Hettner. Handschriflich vervielfältigt.