Boute

SANDBOX -- IN PREPARATION -- WILL BE ADDED TO THE MAIN WIKI PAGES WHEN READY

Towards a coherent article about functions (mathematics)

Rationale The following outline is a suggestion for prospective editors. For various reasons (stability of edits and links to related articles) I prefer not doing any editing myself, but present these notes as a resource for others. The basic rationale is that the article should center on helping the readers rather than on the personal preferences of the writers/editors. Therefore I will not start with my own preferred definition but with a reader-oriented one with the following characteristics: (a) utmost simplicity; (b) enhancing clarity by adhering to the principle of separation of concerns, in this case separating the concept of function pure and simple from characterizing a function as being from $X$ to $Y$ ; (c) the most general one in view of its algebraic properties, especially around composition; (d) prevalent in basic university/college textbooks in mathematics; (e) a convenient logical basis for explaining/understanding/comparing other variants. It is fortunate that all these properties happen to coincide. Also fortunate is that in the current literature there are essentially only two variants, simply distinguished by whether or not the notion of a codomain plays any role, so covering both remains very manageable. Also clarifying for the readers are brief justifications of the design decisions behind the definitions, without turning the article into a fully-fledged tutorial that is too long for Wikipedia. In view of the many misconceptions observed in the printed literature and on the web (including Wikipedia), a substantial package of references is indispensable. The text follows next.

Outline for the article (text starts here)

The concept of a function or a mapping has been described (Herstein^[1], page 9) as "probably the single most important and universal notion that runs through all of mathematics". Evidently this also pertains to all other branches of science (physics, engineering etc.) where mathematics is used.

In present-day mathematics, there are essentially two major variants of the function concept, and in a balanced account both must be addressed. For this purpose, we designate them as (A) the plain variant and (B) the labeled variant, which has a codomain. The subject matter also requires ample references, also because different formulations often define the same variant, thereby clarifying each other. About a dozen paragraph suffice for giving the reader a structured guide through the rather varied literature.

A. Functions: the plain variant

This variant is the simplest and also the most widespread throughout the sciences, including (but not limited to) calculus/analysis^[2]^[3]^[4]^[5]^[6]^[7]^[8]^[9]^[10],set theory^[11]^[12]^[13]^[14]^[15]^[16]^[17],logic^[18]^[19],algebra^[1],discrete mathematics^[20]^[21]^[22],computer science^[23]^[24],and mathematical physics^[25]. Authors and specific page numbers will be mentioned later.

A.1 Basic definition One of the simplest formulations is provided by Apostol^[2] (p.53):

"A function $f$ is a set of ordered pairs $(x,y)$ no two of which have the same first member."

In general, a collection of ordered pairs is called a graph or a relation and is called functional^[12]^[23] or determinate^[21] if no two pairs have the same first member (or component). Thus the preceding definition can be rephrased by saying that a function is a functional graph ((Bourbaki^[12] p. 77). Formulations that are equivalent in content and style appear in calculus/analysis (Apostol^[2] p. 53, Flett^[6], p. 4), set theory (Bourbaki^[12] p. 77, Dasgupta^[13] p. 8, Quine^[15] p. 21, Suppes^[16] p. 57, Tarski & Givant^[17] p. 3), logic (Mendelson^[18] p. 6, Tarski^[19] p. 98), discrete mathematics (Scheinerman^[22] p. 73), computer science ((Meyer^[23] p. 25, Reynolds^[24] p. 452). The wordings differ but all define the same concept, apart from the fact that some authors^[11]^[15]^[17] apply them to classes instead of mere sets.

A.2 Conventions The set of all first members of the ordered pairs in a graph (or relation) $G$ is called the domain of $G$ and is written $\mathrm {dom} \,G$ or ${\mathcal {D}}\,G$ . The set of all second members is called the range of $G$ and is written $\mathrm {ran} \,G$ or ${\mathcal {R}}\,G$ . Let $f$ be a function (functional graph). For each $x$ in the domain of $f$ there is exactly one $y$ such that $(x,y)\in f$ . Hence $y$ is uniquely determined by $f$ and $x$ . It is therefore properly called the value of $f$ at $x$ and can be unambiguously denoted by some suitable combination of $f$ and $x$ , the common "default" form being $f\,x$ or $f(x)$ . Other forms may be chosen as convenient by prior agreement, such as $f_{x}$ or $x^{f}$ . A common example of the latter is writing $M^{\mathsf {T}}$ for matrix transposition.

A.3 The function equality theorem (Apostol^[2] p. 54) Functions $f$ and $g$ are equal ( $f=g$ ) if and only if (a) $f$ and $g$ have the same domain and (b) $f(x)=g(x)$ for every $x$ in this domain. This theorem follows directly from set equality and holds for all formulations (preceding and following) of the definition of plain functions. It implies that a (plain) function $f$ is fully specified by its domain and the value $f(x)$ for each $x$ in that domain. An illustration follows next.

A.4 Function composition This is the most important operation on functions. For any (plain) functions $f$ and $g$ , the composition $g\circ f$ (also written $f\,;g$ ) is also a function, specified as follows: (a) the domain of $g\circ f$ is the set of all values $x$ in the domain of $f$ such that $f(x)$ is in the domain of $g$ and (b) for any such $x$ , the value of $(g\circ f)(x)$ is given by $(g\circ f)(x)=g(f(x))$ or, written with less clutter, $(g\circ f)x=g(f\,x)$ (see Apostol^[2] p. 140, Flett^[6] p. 11, Suppes^[16] p. 87, Tarski & Givant^[17] p. 3, Mendelson^[18] p. 7, Meyer^[23] p. 32, Reynolds^[24] p. 450,452). Composition has the interesting property that, for all functions $f$ , $g$ and $h$ , we have $h\circ (g\circ f)=(h\circ g)\circ f$ . This associativity allows making the parentheses optional and writing, for instance, $h\circ g\circ f=f\,;g\,;h$ .

A.5 Conveying domain and range information The literature presents numerous conventions for relating the domain and/or range of a (plain) function $f$ to sets $X$ and $Y$ . A helpful preamble is the following legend.

statement	meaning
$f$ is a partial function on $X$	$\mathrm {dom} \,f\subseteq X$
$f$ is a (total) function on $X$	$\mathrm {dom} \,f=X$

statement	meaning
$f$ is (in)to $Y$	$\mathrm {ran} \,f\subseteq Y$
$f$ is onto $Y$	$\mathrm {ran} \,f=Y$

For instance (Apostol^[2] p. 578, Flett^[6] p. 5, Dasgupta ^[13] p. 10, Scheinerman^[22] p. 169, Meyer^[23] p. 26, Reynolds^[24] p. 458):

A (total) function from $X$ (in)to $Y$ is a function with domain $X$ and range included in $Y$ .

Flett^[6] (p. 5) warns that such phrases only conveys information about the domain and the range but does not define a new kind of function. A function from $X$ to $Y$ is commonly introduced by writing $f:X\rightarrow Y$ , where $X\rightarrow Y$ can be interpreted as the set of all (total) functions from $X$ (in)to $Y$ (Meyer^[23] p. 26, Reynolds^[24] p. 458), in other contexts also written $Y^{X}$ . As a logical consequence, $f:X\rightarrow Y$ stipulates that (a) the domain of $f$ is $X$ and (b) the values $f(x)$ are in $Y$ and can be further specified, for instance, by a formula. This style is very convenient, as illustrated by the following function specifications

${\mathsf {sqra}}:\mathbb {R} \rightarrow \mathbb {R}$ with ${\mathsf {sqra}}\,x=x^{2}$ and ${\mathsf {sqrb}}:\mathbb {R} \rightarrow \mathbb {R} _{\geq 0}$ with ${\mathsf {sqrb}}\,x=x^{2}$ .

By definition, both specify the same function ( ${\mathsf {sqra}}={\mathsf {sqrb}}$ ) which is onto $\mathbb {R} _{\geq 0}$ but not onto $\mathbb {R}$ .Consider also

${\mathsf {posrt}}:\mathbb {R} _{\geq 0}\rightarrow \mathbb {R} _{\geq 0}$ with $({\mathsf {posrt}}\,x)^{2}=x$ and ${\mathsf {negrt}}:\mathbb {R} _{\geq 0}\rightarrow \mathbb {R} _{\leq 0}$ with $({\mathsf {negrt}}\,x)^{2}=x$ .

Here ${\mathsf {posrt}}$ and ${\mathsf {negrt}}$ are respectively the positive and negative square root function. Both are functions from $\mathbb {R} _{\geq 0}$ to $\mathbb {R}$ but ${\mathsf {posrt}}$ is onto $\mathbb {R} _{\geq 0}$ whereas ${\mathsf {negrt}}$ is onto $\mathbb {R} _{\leq 0}$ .

Similarly, a partial function from $X$ to $Y$ is a function with domain included in $X$ and range included in $Y$ . For instance, in calculus/analysis most functions are defined on some subset (interval, region, ...) of $\mathbb {R}$ , $\mathbb {R} ^{n}$ , $\mathbb {C}$ , $\mathbb {C} ^{n}$ and so on hence are partial on these sets. For the set of partial functions from $X$ (in)to $Y$ one finds various notations, such as $X\not \rightarrow Y$ (Meyer^[23] p. 26) and $X\;{\stackrel {\longrightarrow }{\scriptscriptstyle \mathrm {PFUN} }}\;Y$ (Reynolds^[24] p. 458).

As a very interesting illustration, the reader can verify that, given $f:X\rightarrow Y$ and $g:U\rightarrow V$ , the composition $g\circ f$ is a partial function from $X$ to $V$ and that $g\circ f$ is a total function from $X$ to $V$ iff $\mathrm {ran} \,f\subseteq \mathrm {dom} \,g$ , which trivially holds in case $Y=U$ .

Important remark: as in natural language, onto is used as a preposition, mentioning $Y$ explicitly (Flett^[6] p. 5, Scheinerman^[22] p. 172; more references follow in the next paragraph). A function that is onto $Y$ is sometimes called surjective on $Y$ or a surjection on $Y$ . Scheinerman^[22] (p, 172) designates omitting $Y$ as "mathspeak", but it is not harmless and may cause misunderstandings.

A.6 A shortcut formulation for a function from $X$ to $Y$ Quite a few authors (Bartle & Sherbert^[4] p. 5, Royden^[8] p. 8, Halmos^[14] p. 30,Herstein^[1] p. 10, Gerstein^[20] p. 110, ^[25] Gries & Schneider^[21] p. 280, Szekeres^[25] p. 10) do not start from the basic definition given earlier but directly define a function $f$ from $X$ (in)to $Y$ as a subset of $X\times Y$ such that for every $x$ in $X$ there is exactly one $y$ in $Y$ such that $(x,y)\in f$ . Less often, some authors (Bartle^[3] p. 13, Gries and Schneider^[21] p. 280) use a formulation that amounts to replacing "exactly one" by "at most one", which effectively defines a partial function from $X$ to $Y$ .

Important remark: appearances notwithstanding, this shortcut formulation logically defines exactly the same kind of function as the basic definition with exactly the same properties and conventions. In particular:,

The function equality theorem holds as stated (only mentioned explicitly by Gerstein^[20] p. 113).
Composition $g\circ f$ is defined for any functions $f$ , $g$ , although some authors overlook this and define $g\circ f$ only for $f:X\rightarrow Y$ and $g:Y\rightarrow Z$ , which reduces generality by assuming $\mathrm {ran} \,f\subseteq \mathrm {dom} \,g$ .
Any function $f$ is a function from its domain to any superset of its range. Hence the versatile specification style illustrated by the examples ${\mathsf {sqra}}$ , ${\mathsf {sqrb}}$ , ${\mathsf {posrt}}$ , ${\mathsf {negrt}}$ remains applicable.
As before, onto-ness is specified with respect to a set, using "onto" as a preposition (Bartle^[3] p. 13, Bartle & Sherbert^[4] p.7, Royden^[8] p. 8, Halmos^[14] p. 31, Herstein^[1] p. 12, Gerstein^[20] p. 118, Szekeres^[25] p. 11).

A.7 Separating the plain function concept from its graph Whereas defining a function as a graph is very precise and rigorous, it creates some ambiguities for certain common conventions. Just two examples: (i) writing $f^{n}$ for $n$ -fold function composition and $S^{n}$ for the $n$ -fold Cartesian product, and (ii) defining sequences (in particular pairs) as functions on some subset of the natural numbers. Some definitions (Carlson^[5] p. 182, Kolmogorov & Fomin^[7] p. 5, Rudin^[9] p. 21) avoid this by defining a function $f$ from $X$ to $Y$ less formally as associating "in some manner" a unique value $f(x)$ in $Y$ with every value $x$ in $X$ , called the domain of $f$ . This can be captured as follows:

A (plain) function is an entity that is fully specified by a domain, which is a collection (set or a class) of values, and by a unique value assigned to each element in this domain.

As noted by Royden^[8] (footnote p. 8) this formulation can be made precise by taking the statement of the function equality theorem (A.3) as an axiom. The range of $f$ is then the set of all values $f(x)$ for $x$ in the domain of $f$ .All earlier auxiliary formulations carry through literally as stated, namely, fully general composition (A.4) and conveying domain/range information (A5). The graph of $f$ is then the set of all pairs $(x,fx)$ for $x$ in the domain of $f$ and is denoted by ${\mathcal {G}}\,f$ . Evidently $f=g$ if and only if ${\mathcal {G}}\,f={\mathcal {G}}\,g$ . This may be useful in simplifying certain proofs and definitions (e.g., for inverses).

B. Functions: the labeled variant and the notion of codomain

Recall that, for plain functions, the appearance of $Y$ in $f:X\rightarrow Y$ specifies that $f(x)\in Y$ , without making $Y$ an attribute of $f$ (in contrast $X$ , which is specified to be the domain). How to exploit this flexibility in function specifications was demonstrated by the examples for illustrated by the examples ${\mathsf {sqra}}$ , ${\mathsf {sqrb}}$ , ${\mathsf {posrt}}$ , ${\mathsf {negrt}}$ .

Dasgupta^[13] (p. 10) points out that making $Y$ an attribute of $f$ in a proper fashion requires explicitly attaching $Y$ to $f$ to form a triplet $\langle f,X,Y\rangle$ . Mac Lane^[26] (p. 27) calls this modification labelling. In general,

A (labeled) function is a triplet $F:=\langle f,X,Y\rangle$ where $f$ is a (plain) partial function from $X$ to $Y$ .

The set $X$ is called the source of $F$ and $Y$ is called the target of $F$ or the codomain of $F$ . The domain and the range of $F$ are those of $f$ . Similar formulations, sometimes identifying domain and source, are given by Bourbaki^[12] p. 76, Adámek & al.^[27] footnote p. 14, Bird & De Moor^[28] p. 26, Pierce^[29] p. 2. Some of the major differences with the plain varianr are:

Equality of labeled functions requires equality for source, domain, codomain and images.
For a labeled function $F:=\langle f,X,Y\rangle$ , the following terminology holds: (i) $F$ is partial means that $\mathrm {dom} \,f\subseteq X$ , (ii) $F$ is total means that $\mathrm {dom} \,f=X$ , and (iii) or $F$ is surjective means that $\mathrm {ran} \,f=Y$ .
Composition $G\circ F$ is defined only in case $\mathrm {cod} \,F=\mathrm {src} \,G$ . In case $G$ is total this means that $\mathrm {cod} \,F=\mathrm {dom} \,G$ , which is quite restrictive when compared to the plain variant.

References

Last updated: Boute (talk) 13:11, 15 February 2022 (UTC)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

Search