?
This document uses PHP Chinese website manual Release
在頭文件<stdio.h>中定義 | ||
---|---|---|
(1) | ||
int scanf(const char * format,...); | (直到C99) | |
int scanf(const char * restrict format,...); | (自C99以來) | |
(2) | ||
int fscanf(FILE * stream,const char * format,...); | (直到C99) | |
int fscanf(FILE * restrict stream,const char * restrict format,...); | (自C99以來) | |
(3) | ||
int sscanf(const char * buffer,const char * format,...); | (直到C99) | |
int sscanf(const char *限制緩沖區(qū),const char *限制格式,...); | (自C99以來) | |
int scanf_s(const char * restrict format,...); | (4) | (自C11以來) |
int fscanf_s(FILE * restrict stream,const char * restrict format,...); | (5) | (自C11以來) |
int sscanf_s(const char *限制緩沖區(qū),const char *限制格式,...); | (6) | (自C11以來) |
從各種來源讀取數(shù)據(jù),根據(jù)其解釋并將format
結(jié)果存儲到給定位置。
1)從中讀取數(shù)據(jù) stdin
2)從文件流中讀取數(shù)據(jù) stream
3)從空終止的字符串中讀取數(shù)據(jù)buffer
。到達字符串的末尾等同于達到文件結(jié)束條件fscanf
4-6)與(1-3)相同,不同之處在于%c
,%s
和%[
轉(zhuǎn)換說明符每個都需要兩個參數(shù)(通常的指針和rsize_t
表示接收數(shù)組大小的類型值,當使用%c讀取時可能為1成一個字符),除了在運行時檢測到以下錯誤并調(diào)用當前安裝的約束處理函數(shù):
指針類型的任何參數(shù)都是空指針
format
,stream
或者buffer
是空指針
%c,%s或%[,加上終止空字符,將會超過為每個轉(zhuǎn)換說明符提供的第二個(rsize_t)參數(shù)所寫的字符數(shù)
可選地,還有任何其他可檢測到的錯誤,例如未知的轉(zhuǎn)換說明符
由于所有的邊界檢查功能,scanf_s
,fscanf_s
,和sscanf_s
僅保證可供如果__STDC_LIB_EXT1__
由實現(xiàn)所定義,并且如果用戶定義__STDC_WANT_LIB_EXT1__
的整數(shù)常數(shù)1
,包括之前<stdio.h>
。
流 | - | 輸入文件流從中讀取 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
緩沖 | - | 指向以null結(jié)尾的字符串讀取的指針 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
格式 | - | pointer to a null-terminated character string specifying how to read the input. The format string consists of. non-whitespace multibyte characters except %: each such character in the format string consumes exactly one identical character from the input stream, or causes the function to fail if the next character on the stream does not compare equal. whitespace characters: any single whitespace character in the format string consumes all available consecutive whitespace characters from the input (determined as if by calling isspace in a loop). Note that there is no difference between "\n", " ", "\t\t", or other whitespace in the format string. conversion specifications. Each conversion specification has the following format: introductory % character (optional) assignment-suppressing character *. If this option is present, the function does not assign the result of the conversion to any receiving argument. (optional) integer number (greater than zero) that specifies maximum field width, that is, the maximum number of characters that the function is allowed to consume when doing the conversion specified by the current conversion specification. Note that %s and %[ may lead to buffer overflow if the width is not provided. (optional) length modifier that specifies the size of the receiving argument, that is, the actual destination type. This affects the conversion accuracy and overflow rules. The default destination type is different for each conversion type (see table below). conversion format specifier The following format specifiers are available: Conversion specifier Explanation Argument type length modifier hh (C99). h (none) l ll (C99). j (C99). z (C99). t (C99). L % matches literal % N/A N/A N/A N/A N/A N/A N/A N/A N/A c matches a character or a sequence of characters If a width specifier is used, matches exactly width characters (the argument must be a pointer to an array with sufficient room). Unlike %s and %[, does not append the null character to the array. N/A N/A char* wchar_t* N/A N/A N/A N/A N/A s matches a sequence of non-whitespace characters (a string) If width specifier is used, matches up to width or until the first whitespace character, whichever appears first. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). set matches a non-empty sequence of character from set of characters. If the first character of the set is ^, then all characters not in the set are matched. If the set begins with ] or ^] then the ] character is also included into the set. It is implementation-defined whether the character - in the non-initial position in the scanset may be indicating a range, as in 0-9. If width specifier is used, matches only up to width. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). d matches a decimal integer. The format of the number is the same as expected by strtol() with the value 10 for the base argument. signed char* or unsigned char* signed short* or unsigned short* signed int* or unsigned int* signed long* or unsigned long* signed long long* or unsigned long long* intmax_t* or uintmax_t* size_t* ptrdiff_t* N/A i matches an integer. The format of the number is the same as expected by strtol() with the value 0 for the base argument (base is determined by the first characters parsed). u matches an unsigned decimal integer. The format of the number is the same as expected by strtoul() with the value 10 for the base argument. o matches an unsigned octal integer. The format of the number is the same as expected by strtoul() with the value 8 for the base argument. x, X matches an unsigned hexadecimal integer. The format of the number is the same as expected by strtoul() with the value 16 for the base argument. n returns the number of characters read so far. No input is consumed. Does not increment the assignment count. If the specifier has assignment-suppressing operator defined, the behavior is undefined. a, A(C99) e, E f, F g, G matches a floating-point number. The format of the number is the same as expected by strtof(). N/A N/A float* double* N/A N/A N/A N/A long double* p matches implementation defined character sequence defining a pointer. printf family of functions should produce the same sequence using %p format specifier. N/A N/A void** N/A N/A N/A N/A N/A N/A For every conversion specifier other than n, the longest sequence of input characters which does not exceed any speci?ed ?eld width and which either is exactly what the conversion specifier expects or is a prefix of a sequence it would expect, is what's consumed from the stream. The ?rst character, if any, after this consumed sequence remains unread. If the consumed sequence has length zero or if the consumed sequence cannot be converted as specified above, the matching failure occurs unless end-of-?le, an encoding error, or a read error prevented input from the stream, in which case it is an input failure. All conversion specifiers other than [, c, and n consume and discard all leading whitespace characters (determined as if by calling isspace) before attempting to parse the input. These consumed characters do not count towards the specified maximum field width. The conversion specifiers lc, ls, and l[ perform multibyte-to-wide character conversion as if by calling mbrtowc() with an mbstate_t object initialized to zero before the first character is converted. The conversion specifiers s and [ always store the null terminator in addition to the matched characters. The size of the destination array must be at least one greater than the specified field width. The use of %s or %[, without specifying the destination array size, is as unsafe as gets. The correct conversion specifications for the fixed-width integer types (int8_t, etc) are defined in the header <inttypes.h> (although SCNdMAX, SCNuMAX, etc is synonymous with %jd, %ju, etc). There is a sequence point after the action of each conversion specifier; this permits storing multiple fields in the same "sink" variable. When parsing an incomplete floating-point value that ends in the exponent with no digits, such as parsing "100er" with the conversion specifier %f, the sequence "100e" (the longest prefix of a possibly valid floating-point number) is consumed, resulting in a matching error (the consumed sequence cannot be converted to a floating-point number), with "r" remaining. Existing implementations do not follow this rule and roll back to consume only "100", leaving "er", e.g. glibc bug 1765. | Conversion specifier | Explanation | Argument type | length modifier | hh (C99). | h | (none) | l | ll (C99). | j (C99). | z (C99). | t (C99). | L | % | matches literal % | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | c | matches a character or a sequence of characters If a width specifier is used, matches exactly width characters (the argument must be a pointer to an array with sufficient room). Unlike %s and %[, does not append the null character to the array. | N/A | N/A | char* | wchar_t* | N/A | N/A | N/A | N/A | N/A | s | matches a sequence of non-whitespace characters (a string) If width specifier is used, matches up to width or until the first whitespace character, whichever appears first. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | set | matches a non-empty sequence of character from set of characters. If the first character of the set is ^, then all characters not in the set are matched. If the set begins with ] or ^] then the ] character is also included into the set. It is implementation-defined whether the character - in the non-initial position in the scanset may be indicating a range, as in 0-9. If width specifier is used, matches only up to width. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | d | matches a decimal integer. The format of the number is the same as expected by strtol() with the value 10 for the base argument. | signed char* or unsigned char* | signed short* or unsigned short* | signed int* or unsigned int* | signed long* or unsigned long* | signed long long* or unsigned long long* | intmax_t* or uintmax_t* | size_t* | ptrdiff_t* | N/A | i | matches an integer. The format of the number is the same as expected by strtol() with the value 0 for the base argument (base is determined by the first characters parsed). | u | matches an unsigned decimal integer. The format of the number is the same as expected by strtoul() with the value 10 for the base argument. | o | matches an unsigned octal integer. The format of the number is the same as expected by strtoul() with the value 8 for the base argument. | x, X | matches an unsigned hexadecimal integer. The format of the number is the same as expected by strtoul() with the value 16 for the base argument. | n | returns the number of characters read so far. No input is consumed. Does not increment the assignment count. If the specifier has assignment-suppressing operator defined, the behavior is undefined. | a, A(C99) e, E f, F g, G | matches a floating-point number. The format of the number is the same as expected by strtof(). | N/A | N/A | float* | double* | N/A | N/A | N/A | N/A | long double* | p | matches implementation defined character sequence defining a pointer. printf family of functions should produce the same sequence using %p format specifier. | N/A | N/A | void** | N/A | N/A | N/A | N/A | N/A | N/A |
Conversion specifier | Explanation | Argument type | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
length modifier | hh (C99). | h | (none) | l | ll (C99). | j (C99). | z (C99). | t (C99). | L | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
% | matches literal % | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
c | matches a character or a sequence of characters If a width specifier is used, matches exactly width characters (the argument must be a pointer to an array with sufficient room). Unlike %s and %[, does not append the null character to the array. | N/A | N/A | char* | wchar_t* | N/A | N/A | N/A | N/A | N/A | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
s | matches a sequence of non-whitespace characters (a string) If width specifier is used, matches up to width or until the first whitespace character, whichever appears first. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
set | matches a non-empty sequence of character from set of characters. If the first character of the set is ^, then all characters not in the set are matched. If the set begins with ] or ^] then the ] character is also included into the set. It is implementation-defined whether the character - in the non-initial position in the scanset may be indicating a range, as in 0-9. If width specifier is used, matches only up to width. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
d | matches a decimal integer. The format of the number is the same as expected by strtol() with the value 10 for the base argument. | signed char* or unsigned char* | signed short* or unsigned short* | signed int* or unsigned int* | signed long* or unsigned long* | signed long long* or unsigned long long* | intmax_t *或uintmax_t * | 為size_t * | ptrdiff_t的* | N / A | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
一世 | 匹配一個整數(shù)。數(shù)字的格式與strtol()所期望的相同,基本參數(shù)的值為0(基數(shù)由解析的第一個字符決定)。 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
在 | 匹配一個無符號的十進制整數(shù)。數(shù)字的格式與strtoul()的預期相同,基本參數(shù)的值為10。 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
該 | 匹配一個無符號的八進制整數(shù)。數(shù)字的格式與strtoul()的預期相同,基本參數(shù)的值為8。 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
x,X | 匹配一個無符號的十六進制整數(shù)。數(shù)字的格式與strtoul()預期的基本參數(shù)的值為16相同。 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
? | 返回迄今閱讀的字符數(shù)。沒有輸入被消耗。不增加分配計數(shù)。如果說明符具有定義的分配抑制運算符,則行為是未定義的。 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
a,A(C99)e,E f,F(xiàn) g,G | 匹配一個浮點數(shù)。數(shù)字的格式與strtof()的預期相同。 | N / A | N / A | 浮動* | 雙* | N / A | N / A | N / A | N / A | 長雙* | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
p | 匹配定義指針的實現(xiàn)定義的字符序列。printf系列函數(shù)應該使用%p格式說明符產(chǎn)生相同的序列。 | N / A | N / A | 無效** | N / A | N / A | N / A | N / A | N / A | N / A | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
... | - | 接收參數(shù) |
對于每個轉(zhuǎn)換說明符以外的n
,最長的輸入字符序列不超過任何指定的字段寬度,或者正是轉(zhuǎn)換說明符所期望的或者是期望序列的前綴,是流中消耗的內(nèi)容。在消耗序列之后的第一個字符(如果有的話)仍然未讀。如果消耗的序列長度為零或消費的序列無法如上所述進行轉(zhuǎn)換,則會發(fā)生匹配失敗,除非文件結(jié)束,編碼錯誤或讀取錯誤阻止來自流的輸入,在這種情況下,它是輸入失敗。
在嘗試解析輸入之前,除了[
,,之外的所有轉(zhuǎn)換說明符c
并n
消耗并放棄所有前導空白字符(如同調(diào)用isspace一樣確定)。這些消耗的字符不會計入指定的最大字段寬度。
轉(zhuǎn)換說明符lc
,ls
并l[
執(zhí)行多字節(jié)到寬字符的轉(zhuǎn)換,就好像通過調(diào)用mbrtowc()并在轉(zhuǎn)換第一個字符之前將mbstate_t對象初始化為零一樣。
除了匹配的字符之外,轉(zhuǎn)換說明符s
并[
始終存儲空終止符。目標數(shù)組的大小必須至少比指定的字段寬度大1。使用%小號或%[,而不指定目的地的陣列大小,是為不安全作為得到
為正確的轉(zhuǎn)換規(guī)格固定寬度整數(shù)類型(int8_t
等)都在頭中定義<inttypes.h>還(雖然SCNdMAX,SCNuMAX等是同義詞%jd
,%ju
等)。
每個轉(zhuǎn)換說明符的操作之后都有一個序列點 ; 這允許將多個字段存儲在相同的“匯”變量中。
當解析一個不帶數(shù)字的指數(shù)結(jié)束的不完整浮點值時,例如用轉(zhuǎn)換說明符%f解析“100er”時,會消耗序列“100e”(可能有效的浮點數(shù)的最長前綴) ,導致匹配錯誤(消耗的序列不能轉(zhuǎn)換為浮點數(shù)),并保留“r”?,F(xiàn)有的實現(xiàn)不遵循這個規(guī)則并回滾到只消耗“100”,留下“er”,例如glibc bug 1765
如果轉(zhuǎn)換規(guī)范無效,則行為未定義。
... | - | 接收參數(shù) |
---|
1-3)成功分配的接收參數(shù)數(shù)量(EOF
如果在分配第一個接收參數(shù)之前發(fā)生匹配故障,則可能為零),或者在分配第一個接收參數(shù)前發(fā)生輸入故障。
4-6)與(1-3)相同,但是EOF
如果存在運行時約束沖突,也會返回。
由于大多數(shù)轉(zhuǎn)換說明符首先消耗所有連續(xù)的空白,所以代碼如。
scanf("%d", &a);scanf("%d", &b);
將讀取在不同行上輸入的兩個整數(shù)(第二個%d將使用第一個剩下的換行符)或在同一行上,由空格或制表符分隔(第二個%d將使用空格或制表符)。不使用前導空格的轉(zhuǎn)換說明符(如%c)可以通過在格式字符串中使用空格字符來完成:
scanf("%d", &a);scanf(" %c", &c); // ignore the endline after %d, then read a char
#define __STDC_WANT_LIB_EXT1__ 1#include <stdio.h>#include <stddef.h>#include <locale.h> int main(void){ int i, j; float x, y; char str1[10], str2[4]; wchar_t warr[2]; setlocale(LC_ALL, "en_US.utf8"); char input[] = "25 54.32E-1 Thompson 56789 0123 56?水"; /* parse as follows: %d: an integer %f: a floating-point value %9s: a string of at most 9 non-whitespace characters %2d: two-digit integer (digits 5 and 6) %f: a floating-point value (digits 7, 8, 9) %*d: an integer which isn't stored anywhere ' ': all consecutive whitespace %3[0-9]: a string of at most 3 decimal digits (digits 5 and 6) %2lc: two wide characters, using multibyte to wide conversion */ int ret = sscanf(input, "%d%f%9s%2d%f%*d %3[0-9]%2lc", &i, &x, str1, &j, &y, str2, warr); printf("Converted %d fields:\ni = %d\nx = %f\nstr1 = %s\n" "j = %d\ny = %f\nstr2 = %s\n" "warr[0] = U+%x warr[1] = U+%x\n", ret, i, x, str1, j, y, str2, warr[0], warr[1]); #ifdef __STDC_LIB_EXT1__ int n = sscanf_s(input, "%d%f%s", &i, &x, str1, (rsize_t)sizeof str1); // writes 25 to i, 5.432 to x, the 9 bytes "thompson\0" to str1, and 3 to n.#endif}
輸出:
Converted 7 fields:i = 25x = 5.432000str1 = Thompson j = 56y = 789.000000str2 = 56warr[0] = U+df warr[1] = U+6c34
C11標準(ISO / IEC 9899:2011):
7.21.6.2 fscanf函數(shù)(p:317-324)
7.21.6.4 scanf函數(shù)(p:325)
7.21.6.7 sscanf函數(shù)(p:326)
K.3.5.3.2 fscanf_s函數(shù)(p:592-593)
K.3.5.3.4 scanf_s函數(shù)(p:594)
K.3.5.3.7 sscanf_s函數(shù)(p:596)
C99標準(ISO / IEC 9899:1999):
7.19.6.2 fscanf函數(shù)(p:282-289)
7.19.6.4 scanf函數(shù)(p:290)
7.19.6.7 sscanf函數(shù)(p:291)
C89 / C90標準(ISO / IEC 9899:1990):
4.9.6.2 fscanf函數(shù)
4.9.6.4 scanf函數(shù)
4.9.6.6 sscanf函數(shù)