/ December 2011; 6 years ago ( 2011-12),.c,.h Major K&R, Influenced by (, ), Influenced:, (HDL),. at Wikibooks C (, as in the ) is a, computer, supporting, and, while a prevents many unintended operations. By design, C provides constructs that map efficiently to typical, and therefore it has found lasting use in applications that had formerly been coded in, including, as well as various for computers ranging from to. C was originally developed by between 1969 and 1973 at, and used to re-implement the operating system. It has since become one of the of all time, with C from various vendors available for the majority of existing and operating systems.
C has been standardized by the (ANSI) since 1989 (see ) and subsequently by the (ISO). C is an language. It was designed to be compiled using a relatively straightforward, to provide low-level access to memory, to provide language constructs that map efficiently to machine instructions, and to require minimal.
Despite its low-level capabilities, the language was designed to encourage programming. A standards-compliant and written C program can be compiled for a very wide variety of computer platforms and operating systems with few changes to its source code. The language has become available on a very wide range of platforms, from embedded to. (right), the inventor of the C programming language, with Like most imperative languages in the tradition, C has facilities for and allows and recursion, while a static prevents many unintended operations. In C, all is contained within, which are called 'functions' (although not in the strict sense of ). Are always passed by value.
Pass-by-reference is simulated in C by explicitly passing values. C program source text is, using the as a terminator and for grouping. The C language also exhibits the following characteristics:. There is a small, fixed number of keywords, including a full set of primitives:, and. User-defined names are not distinguished from keywords by any kind of. There are a large number of arithmetical and logical operators, such as +, +=, , &, , etc. More than one may be performed in a single statement.
Function return values can be ignored when not needed. Typing is, but: all data has a type, but implicit conversions may be performed. mimics usage context. C has no 'define' keyword; instead, a statement beginning with the name of a type is taken as a declaration. There is no 'function' keyword; instead, a function is indicated by the parentheses of an argument list. User-defined ( typedef) and compound types are possible. Heterogeneous aggregate data types ( ) allow related data elements to be accessed and assigned as a unit.
Union is a structure with overlapping members; only the last member stored is valid. indexing is a secondary notation, defined in terms of pointer arithmetic. Unlike structs, arrays are not first-class objects; they cannot be assigned or compared using single built-in operators. There is no 'array' keyword, in use or definition; instead, square brackets indicate arrays syntactically, for example month11. are possible with the enum keyword. They are freely interconvertible with integers.
are not a separate data type, but are conventionally as arrays of characters. Low-level access to is possible by converting machine addresses to typed. (subroutines not returning values) are a special case of function, with an untyped return type void. Functions may not be defined within the lexical scope of other functions. Function and data pointers permit ad hoc.
A performs definition, file inclusion, and. There is a basic form of: files can be compiled separately and together, with control over which functions and data objects are visible to other files via and extern attributes. Complex functionality such as, manipulation, and mathematical functions are consistently delegated to.
While C does not include some features found in some other languages, such as or, such features can be implemented or emulated in C, often by way of external libraries (e.g., the or the ). Relations to other languages Many later languages have borrowed directly or indirectly from C, including, Unix's, and (hardware description language).
These languages have drawn many of their and other basic features from C. Most of them (with Python being the most dramatic exception) are also very similar to C in general, and they tend to combine the recognizable expression and statement with underlying type systems, data models, and semantics that can be radically different. History Early developments Year C Standard 1972 Birth 1978 K&R C 19(89/90) ANSI C and ISO C 1999 C99 2011 C11 The origin of C is closely tied to the development of the operating system, originally implemented in on a by Dennis Ritchie and Ken Thompson, incorporating several ideas from colleagues. Eventually, they decided to port the operating system to a.
The original PDP-11 version of Unix was developed in assembly language. The developers were considering rewriting the system using the, Thompson's simplified version of.
However B's inability to take advantage of some of the PDP-11's features, notably addressability, led to C. The name of C was chosen simply as the next after B. The development of C started in 1972 on the PDP-11 Unix system and first appeared in. The language was not initially designed with portability in mind, but soon ran on different platforms as well: a compiler for the was written within the first year of C's history, while an port followed soon. Also in 1972, a large part of Unix was rewritten in C. By 1973, with the addition of struct types, the C language had become powerful enough that most of the was now in C.
Unix was one of the first operating system kernels implemented in a language other than. Earlier instances include the system (which was written in ) and (MCP) for the (which was written in ) in 1961. In around 1977, Ritchie and made further changes to the language to facilitate portability of the Unix operating system. Johnson's served as the basis for several implementations of C on new platforms. The cover of the book, The C Programming Language, first edition by and In 1978, and published the first edition of.
This book, known to C programmers as 'K&R', served for many years as an informal of the language. The version of C that it describes is commonly referred to as K&R C. The second edition of the book covers the later standard, described below.
K&R introduced several language features:. Standard I/O library. long int data type. unsigned int data type. Compound assignment operators of the form = op (such as =-) were changed to the form op= (that is, -=) to remove the semantic ambiguity created by constructs such as i=-10, which had been interpreted as i =- 10 (decrement i by 10) instead of the possibly intended i = -10 (let i be -10).
Even after the publication of the 1989 ANSI standard, for many years K&R C was still considered the ' to which C programmers restricted themselves when maximum portability was desired, since many older compilers were still in use, and because carefully written K&R C code can be legal Standard C as well. In early versions of C, only functions that return types other than int must be declared if used before the function definition; functions used without prior declaration were presumed to return type int. Main article: During the late 1970s and 1980s, versions of C were implemented for a wide variety of, and, including the, as its popularity began to increase significantly. In 1983, the (ANSI) formed a committee, X3J11, to establish a standard specification of C.
X3J11 based the C standard on the Unix implementation; however, the non-portable portion of the Unix C library was handed off to the 1003 to become the basis for the 1988 standard. In 1989, the C standard was ratified as ANSI X3.159-1989 'Programming Language C'. This version of the language is often referred to as, Standard C, or sometimes C89. In 1990, the ANSI C standard (with formatting changes) was adopted by the (ISO) as ISO/IEC 9899:1990, which is sometimes called C90.
Therefore, the terms 'C89' and 'C90' refer to the same programming language. ANSI, like other national standards bodies, no longer develops the C standard independently, but defers to the international C standard, maintained by the working group /WG14. National adoption of an update to the international standard typically occurs within a year of ISO publication.
One of the aims of the C standardization process was to produce a of K&R C, incorporating many of the subsequently introduced unofficial features. The standards committee also included several additional features such as (borrowed from C), void pointers, support for international and, and preprocessor enhancements. Although the for parameter declarations was augmented to include the style used in C, the K&R interface continued to be permitted, for compatibility with existing source code. C89 is supported by current C compilers, and most C code being written today is based on it.
Any program written only in Standard C and without any hardware-dependent assumptions will run correctly on any with a conforming C implementation, within its resource limits. Without such precautions, programs may compile only on a certain platform or with a particular compiler, due, for example, to the use of non-standard libraries, such as libraries, or to a reliance on compiler- or platform-specific attributes such as the exact size of data types and byte. In cases where code must be compilable by either standard-conforming or K&R C-based compilers, the STDC macro can be used to split the code into Standard and K&R sections to prevent the use on a K&R C-based compiler of features available only in Standard C. After the ANSI/ISO standardization process, the C language specification remained relatively static for several years. In 1995, Normative Amendment 1 to the 1990 C standard (ISO/IEC 9899/AMD1:1995, known informally as C95) was published, to correct some details and to add more extensive support for international character sets. Main article: The C standard was further revised in the late 1990s, leading to the publication of ISO/IEC 9899:1999 in 1999, which is commonly referred to as '. It has since been amended three times by Technical Corrigenda.
C99 introduced several new features, including, several new (including long long int and a complex type to represent ), and, improved support for floating point, support for (macros of variable ), and support for one-line comments beginning with //, as in or C. Many of these had already been implemented as extensions in several C compilers. C99 is for the most part backward compatible with C90, but is stricter in some ways; in particular, a declaration that lacks a type specifier no longer has int implicitly assumed. A standard macro STDCVERSION is defined with value 199901L to indicate that C99 support is available., and other C compilers now support many or all of the new features of C99. The C compiler in, however, implements the C89 standard and those parts of C99 that are required for compatibility with. Main article: In 2007, work began on another revision of the C standard, informally called 'C1X' until its official publication on 2011-12-08.
The C standards committee adopted guidelines to limit the adoption of new features that had not been tested by existing implementations. The C11 standard adds numerous new features to C and the library, including type generic macros, anonymous structures, improved Unicode support, atomic operations, multi-threading, and bounds-checked functions. It also makes some portions of the existing C99 library optional, and improves compatibility with C. The standard macro STDCVERSION is defined as 201112L to indicate that C11 support is available. Main article: Historically, embedded C programming requires nonstandard extensions to the C language in order to support exotic features such as fixed-point arithmetic, multiple distinct memory banks, and basic I/O operations. In 2008, the C Standards Committee published a technical report extending the C language to address these issues by providing a common standard for all implementations to adhere to.
It includes a number of features not available in normal C, such as, named address spaces, and basic I/O hardware addressing. Alignas Alignof Atomic Generic Noreturn Staticassert Threadlocal Most of the recently reserved words begin with an underscore followed by a capital letter, because identifiers of that form were previously reserved by the C standard for use only by implementations. Since existing program source code should not have been using these identifiers, it would not be affected when C implementations started supporting these extensions to the programming language.
Some standard headers do define more convenient synonyms for underscored identifiers. The language previously included a reserved word called entry, but this was seldom implemented, and has now been removed as a reserved word. Main article: C supports a rich set of, which are symbols used within an to specify the manipulations to be performed while evaluating that expression. C has operators for:.:,.: =.: +=, -=,.=, /=,%=, &=, =, ^=, =.: , &, , ^.:.:!, &&.:.
equality testing:,.: ( ).: , -.:., -. object size:.:, =.: &,. sequencing:.: ( ).: ( typename) C uses the operator = (used in mathematics to express equality) to indicate assignment, following the precedent of and, but unlike and its derivatives. C uses the operator to test for equality. The similarity between these two operators (assignment and equality) may result in the accidental use of one in place of the other, and in many cases, the mistake does not produce an error message (although some compilers produce warnings). For example, the conditional expression if(ab+1) might mistakenly be written as if(a=b+1), which will be evaluated as true if a is not zero after the assignment. The C is not always intuitive.
For example, the operator binds more tightly than (is executed prior to) the operators & (bitwise AND) and (bitwise OR) in expressions such as x & 1 0, which must be written as (x & 1) 0 if that is the coder's intent. 'Hello, world' example The ' example, which appeared in the first edition of, has become the model for an introductory program in most programming textbooks, regardless of programming language. The program prints 'hello, world' to the, which is usually a terminal or screen display. The original version was.
This section needs additional citations for. Unsourced material may be challenged and removed. (October 2012) The in C is and, which makes it similar to the type system of descendants such as. There are built-in types for integers of various sizes, both signed and unsigned, and enumerated types ( enum). Integer type char is often used for single-byte characters. There are also derived types including, ( ), and ( union).
C is often used in low-level systems programming where escapes from the type system may be necessary. The compiler attempts to ensure type correctness of most expressions, but the programmer can override the checks in various ways, either by using a to explicitly convert a value from one type to another, or by using pointers or unions to reinterpret the underlying bits of a data object in some other way. Some find C's declaration syntax unintuitive, particularly for. (Ritchie's idea was to declare identifiers in contexts resembling their use: '.) C's usual arithmetic conversions allow for efficient code to be generated, but can sometimes produce unexpected results. For example, a comparison of signed and unsigned integers of equal width requires a conversion of the signed value to unsigned. This can generate unexpected results if the signed value is negative.
Pointers C supports the use of, a type of that records the address or location of an object or function in memory. Pointers can be dereferenced to access data stored at the address pointed to, or to invoke a pointed-to function. Pointers can be manipulated using assignment. The run-time representation of a pointer value is typically a raw memory address (perhaps augmented by an offset-within-word field), but since a pointer's type includes the type of the thing pointed to, expressions including pointers can be type-checked at compile time. Pointer arithmetic is automatically scaled by the size of the pointed-to data type. Pointers are used for many purposes in C.
Are commonly manipulated using pointers into arrays of characters. Is performed using pointers. Many data types, such as, are commonly implemented as dynamically allocated struct objects linked together using pointers.
Pointers to functions are useful for passing functions as arguments to (such as or ) or as to be invoked by event handlers. A value explicitly points to no valid location. Counter-strike source digital zone.
Dereferencing a null pointer value is undefined, often resulting in a. Null pointer values are useful for indicating special cases such as no 'next' pointer in the final node of a, or as an error indication from functions returning pointers. In appropriate contexts in source code, such as for assigning to a pointer variable, a null pointer constant can be written as 0, with or without explicit casting to a pointer type, or as the NULL macro defined by several standard headers. In conditional contexts, null pointer values evaluate to false, while all other pointer values evaluate to true. Void pointers ( void.) point to objects of unspecified type, and can therefore be used as 'generic' data pointers.
Since the size and type of the pointed-to object is not known, void pointers cannot be dereferenced, nor is pointer arithmetic on them allowed, although they can easily be (and in many contexts implicitly are) converted to and from any other object pointer type. Careless use of pointers is potentially dangerous. Because they are typically unchecked, a pointer variable can be made to point to any arbitrary location, which can cause undesirable effects.
Although properly used pointers point to safe places, they can be made to point to unsafe places by using invalid; the objects they point to may continue to be used after deallocation ; they may be used without having been initialized ; or they may be directly assigned an unsafe value using a cast, union, or through another corrupt pointer. In general, C is permissive in allowing manipulation of and conversion between pointer types, although compilers typically provide options for various levels of checking. Some other programming languages address these problems by using more restrictive types.
See also: types in C are traditionally of a fixed, static size specified at compile time. (The more recent C99 standard also allows a form of variable-length arrays.) However, it is also possible to allocate a block of memory (of arbitrary size) at run-time, using the standard library's malloc function, and treat it as an array. C's unification of arrays and pointers means that declared arrays and these dynamically allocated simulated arrays are virtually interchangeable. Since arrays are always accessed (in effect) via pointers, array accesses are typically not checked against the underlying array size, although some compilers may provide as an option. Array bounds violations are therefore possible and rather common in carelessly written code, and can lead to various repercussions, including illegal memory accesses, corruption of data, and run-time exceptions. If bounds checking is desired, it must be done manually.
C does not have a special provision for declaring, but rather relies on within the type system to declare arrays of arrays, which effectively accomplishes the same thing. The index values of the resulting 'multi-dimensional array' can be thought of as increasing in. Multi-dimensional arrays are commonly used in numerical algorithms (mainly from applied ) to store matrices. The structure of the C array is well suited to this particular task. However, since arrays are passed merely as pointers, the bounds of the array must be known fixed values or else explicitly passed to any subroutine that requires them, and dynamically sized arrays of arrays cannot be accessed using double indexing. (A workaround for this is to allocate the array with an additional 'row vector' of pointers to the columns.) C99 introduced 'variable-length arrays' which address some, but not all, of the issues with ordinary C arrays. Array–pointer interchangeability The subscript notation xi (where x designates a pointer) is for.(x+i).
Taking advantage of the compiler's knowledge of the pointer type, the address that x + i points to is not the base address (pointed to by x) incremented by i bytes, but rather is defined to be the base address incremented by i multiplied by the size of an element that x points to. Thus, xi designates the i+1th element of the array. Furthermore, in most expression contexts (a notable exception is as operand of ), the name of an array is automatically converted to a pointer to the array's first element. This implies that an array is never copied as a whole when named as an argument to a function, but rather only the address of its first element is passed. Therefore, although function calls in C use semantics, arrays are in effect passed.
The size of an element can be determined by applying the operator sizeof to any dereferenced element of x, as in n = sizeof.x or n = sizeof x0, and the number of elements in a declared array A can be determined as sizeof A / sizeof A0. The latter only applies to array names: variables declared with subscripts ( int A20). Due to the semantics of C, it is not possible to determine the entire size of arrays through pointers to arrays or those created by dynamic allocation ( ); code such as sizeof arr / sizeof arr0 (where arr designates a pointer) will not work since the compiler assumes the size of the pointer itself is being requested. Since array name arguments to sizeof are not converted to pointers, they do not exhibit such ambiguity. However, arrays created by dynamic allocation are accessed by pointers rather than true array variables, so they suffer from the same sizeof issues as array pointers.
Thus, despite this apparent equivalence between array and pointer variables, there is still a distinction to be made between them. Even though the name of an array is, in most expression contexts, converted into a pointer (to its first element), this pointer does not itself occupy any storage; the array name is not an, and its address is a constant, unlike a pointer variable.
Consequently, what an array 'points to' cannot be changed, and it is impossible to assign a new address to an array name. Array contents may be copied, however, by using the memcpy function, or by accessing the individual elements.
Memory management One of the most important functions of a programming language is to provide facilities for managing and the objects that are stored in memory.