Top |
GScanner * | g_scanner_new () |
void | g_scanner_destroy () |
void | g_scanner_input_file () |
void | g_scanner_sync_file_offset () |
void | g_scanner_input_text () |
GTokenType | g_scanner_peek_next_token () |
GTokenType | g_scanner_get_next_token () |
gboolean | g_scanner_eof () |
guint | g_scanner_cur_line () |
guint | g_scanner_cur_position () |
GTokenType | g_scanner_cur_token () |
GTokenValue | g_scanner_cur_value () |
guint | g_scanner_set_scope () |
void | g_scanner_scope_add_symbol () |
void | g_scanner_scope_foreach_symbol () |
gpointer | g_scanner_scope_lookup_symbol () |
void | g_scanner_scope_remove_symbol () |
#define | g_scanner_add_symbol() |
#define | g_scanner_remove_symbol() |
#define | g_scanner_foreach_symbol() |
#define | g_scanner_freeze_symbol_table() |
#define | g_scanner_thaw_symbol_table() |
gpointer | g_scanner_lookup_symbol () |
void | g_scanner_warn () |
void | g_scanner_error () |
void | g_scanner_unexp_token () |
void | (*GScannerMsgFunc) () |
struct | GScanner |
struct | GScannerConfig |
#define | G_CSET_a_2_z |
#define | G_CSET_A_2_Z |
#define | G_CSET_DIGITS |
#define | G_CSET_LATINC |
#define | G_CSET_LATINS |
enum | GTokenType |
union | GTokenValue |
enum | GErrorType |
GScanner *
g_scanner_new (const GScannerConfig *config_templ
);
Creates a new GScanner.
The config_templ
structure specifies the initial settings
of the scanner, which are copied into the GScanner
config
field. If you pass NULL
then the default settings
are used.
void
g_scanner_destroy (GScanner *scanner
);
Frees all memory used by the GScanner.
void g_scanner_input_file (GScanner *scanner
,gint input_fd
);
Prepares to scan a file.
void
g_scanner_sync_file_offset (GScanner *scanner
);
Rewinds the filedescriptor to the current buffer position and blows the file read ahead buffer. This is useful for third party uses of the scanners filedescriptor, which hooks onto the current scanning position.
void g_scanner_input_text (GScanner *scanner
,const gchar *text
,guint text_len
);
Prepares to scan a text buffer.
GTokenType
g_scanner_peek_next_token (GScanner *scanner
);
Parses the next token, without removing it from the input stream.
The token data is placed in the next_token
, next_value
, next_line
,
and next_position
fields of the GScanner structure.
Note that, while the token is not removed from the input stream
(i.e. the next call to g_scanner_get_next_token()
will return the
same token), it will not be reevaluated. This can lead to surprising
results when changing scope or the scanner configuration after peeking
the next token. Getting the next token after switching the scope or
configuration will return whatever was peeked before, regardless of
any symbols that may have been added or removed in the new scope.
GTokenType
g_scanner_get_next_token (GScanner *scanner
);
Parses the next token just like g_scanner_peek_next_token()
and also removes it from the input stream. The token data is
placed in the token
, value
, line
, and position
fields of
the GScanner structure.
gboolean
g_scanner_eof (GScanner *scanner
);
Returns TRUE
if the scanner has reached the end of
the file or text buffer.
guint
g_scanner_cur_line (GScanner *scanner
);
Returns the current line in the input stream (counting
from 1). This is the line of the last token parsed via
g_scanner_get_next_token()
.
guint
g_scanner_cur_position (GScanner *scanner
);
Returns the current position in the current line (counting
from 0). This is the position of the last token parsed via
g_scanner_get_next_token()
.
GTokenType
g_scanner_cur_token (GScanner *scanner
);
Gets the current token type. This is simply the token
field in the GScanner structure.
GTokenValue
g_scanner_cur_value (GScanner *scanner
);
Gets the current token value. This is simply the value
field in the GScanner structure.
guint g_scanner_set_scope (GScanner *scanner
,guint scope_id
);
Sets the current scope.
void g_scanner_scope_add_symbol (GScanner *scanner
,guint scope_id
,const gchar *symbol
,gpointer value
);
Adds a symbol to the given scope.
scanner |
a GScanner |
|
scope_id |
the scope id |
|
symbol |
the symbol to add |
|
value |
the value of the symbol |
void g_scanner_scope_foreach_symbol (GScanner *scanner
,guint scope_id
,GHFunc func
,gpointer user_data
);
Calls the given function for each of the symbol/value pairs
in the given scope of the GScanner. The function is passed
the symbol and value of each pair, and the given user_data
parameter.
scanner |
a GScanner |
|
scope_id |
the scope id |
|
func |
the function to call for each symbol/value pair |
|
user_data |
user data to pass to the function |
gpointer g_scanner_scope_lookup_symbol (GScanner *scanner
,guint scope_id
,const gchar *symbol
);
Looks up a symbol in a scope and return its value. If the
symbol is not bound in the scope, NULL
is returned.
void g_scanner_scope_remove_symbol (GScanner *scanner
,guint scope_id
,const gchar *symbol
);
Removes a symbol from a scope.
#define g_scanner_add_symbol( scanner, symbol, value )
g_scanner_add_symbol
has been deprecated since version 2.2 and should not be used in newly-written code.
Use g_scanner_scope_add_symbol()
instead.
Adds a symbol to the default scope.
#define g_scanner_remove_symbol( scanner, symbol )
g_scanner_remove_symbol
has been deprecated since version 2.2 and should not be used in newly-written code.
Use g_scanner_scope_remove_symbol()
instead.
Removes a symbol from the default scope.
#define g_scanner_foreach_symbol( scanner, func, data )
g_scanner_foreach_symbol
has been deprecated since version 2.2 and should not be used in newly-written code.
Use g_scanner_scope_foreach_symbol()
instead.
Calls a function for each symbol in the default scope.
scanner |
a GScanner |
|
func |
the function to call with each symbol |
|
data |
data to pass to the function |
#define g_scanner_freeze_symbol_table(scanner)
g_scanner_freeze_symbol_table
has been deprecated since version 2.2 and should not be used in newly-written code.
This macro does nothing.
There is no reason to use this macro, since it does nothing.
#define g_scanner_thaw_symbol_table(scanner)
g_scanner_thaw_symbol_table
has been deprecated since version 2.2 and should not be used in newly-written code.
This macro does nothing.
There is no reason to use this macro, since it does nothing.
gpointer g_scanner_lookup_symbol (GScanner *scanner
,const gchar *symbol
);
Looks up a symbol in the current scope and return its value.
If the symbol is not bound in the current scope, NULL
is
returned.
the value of symbol
in the current scope, or NULL
if symbol
is not bound in the current scope
void g_scanner_warn (GScanner *scanner
,const gchar *format
,...
);
Outputs a warning message, via the GScanner message handler.
scanner |
a GScanner |
|
format |
the message format. See the |
|
... |
the parameters to insert into the format string |
void g_scanner_error (GScanner *scanner
,const gchar *format
,...
);
Outputs an error message, via the GScanner message handler.
scanner |
a GScanner |
|
format |
the message format. See the |
|
... |
the parameters to insert into the format string |
void g_scanner_unexp_token (GScanner *scanner
,GTokenType expected_token
,const gchar *identifier_spec
,const gchar *symbol_spec
,const gchar *symbol_name
,const gchar *message
,gint is_error
);
Outputs a message through the scanner's msg_handler,
resulting from an unexpected token in the input stream.
Note that you should not call g_scanner_peek_next_token()
followed by g_scanner_unexp_token()
without an intermediate
call to g_scanner_get_next_token()
, as g_scanner_unexp_token()
evaluates the scanner's current token (not the peeked token)
to construct part of the message.
scanner |
a GScanner |
|
expected_token |
the expected token |
|
identifier_spec |
a string describing how the scanner's user
refers to identifiers ( |
|
symbol_spec |
a string describing how the scanner's user refers
to symbols ( |
|
symbol_name |
the name of the symbol, if the scanner's current token is a symbol. |
|
message |
a message string to output at the end of the
warning/error, or |
|
is_error |
if |
struct GScanner { /* unused fields */ gpointer user_data; guint max_parse_errors; /* g_scanner_error() increments this field */ guint parse_errors; /* name of input stream, featured by the default message handler */ const gchar *input_name; /* quarked data */ GData *qdata; /* link into the scanner configuration */ GScannerConfig *config; /* fields filled in after g_scanner_get_next_token() */ GTokenType token; GTokenValue value; guint line; guint position; /* fields filled in after g_scanner_peek_next_token() */ GTokenType next_token; GTokenValue next_value; guint next_line; guint next_position; /* handler function for _warn and _error */ GScannerMsgFunc msg_handler; };
The data structure representing a lexical scanner.
You should set input_name
after creating the scanner, since
it is used by the default message handler when displaying
warnings and errors. If you are scanning a file, the filename
would be a good choice.
The user_data
and max_parse_errors
fields are not used.
If you need to associate extra data with the scanner you
can place them here.
If you want to use your own message handler you can set the
msg_handler
field. The type of the message handler function
is declared by GScannerMsgFunc.
gpointer |
unused |
|
guint |
unused |
|
guint |
g_scanner_error() increments this field |
|
const gchar * |
name of input stream, featured by the default message handler |
|
GData * |
quarked data |
|
GScannerConfig * |
link into the scanner configuration |
|
GTokenType |
token parsed by the last |
|
GTokenValue |
value of the last token from |
|
guint |
line number of the last token from |
|
guint |
char number of the last token from |
|
GTokenType |
token parsed by the last |
|
GTokenValue |
value of the last token from |
|
guint |
line number of the last token from |
|
guint |
char number of the last token from |
|
GScannerMsgFunc |
handler function for _warn and _error |
struct GScannerConfig { /* Character sets */ gchar *cset_skip_characters; /* default: " \t\n" */ gchar *cset_identifier_first; gchar *cset_identifier_nth; gchar *cpair_comment_single; /* default: "#\n" */ /* Should symbol lookup work case sensitive? */ guint case_sensitive : 1; /* Boolean values to be adjusted "on the fly" * to configure scanning behaviour. */ guint skip_comment_multi : 1; /* C like comment */ guint skip_comment_single : 1; /* single line comment */ guint scan_comment_multi : 1; /* scan multi line comments? */ guint scan_identifier : 1; guint scan_identifier_1char : 1; guint scan_identifier_NULL : 1; guint scan_symbols : 1; guint scan_binary : 1; guint scan_octal : 1; guint scan_float : 1; guint scan_hex : 1; /* '0x0ff0' */ guint scan_hex_dollar : 1; /* '$0ff0' */ guint scan_string_sq : 1; /* string: 'anything' */ guint scan_string_dq : 1; /* string: "\\-escapes!\n" */ guint numbers_2_int : 1; /* bin, octal, hex => int */ guint int_2_float : 1; /* int => G_TOKEN_FLOAT? */ guint identifier_2_string : 1; guint char_2_token : 1; /* return G_TOKEN_CHAR? */ guint symbol_2_token : 1; guint scope_0_fallback : 1; /* try scope 0 on lookups? */ guint store_int64 : 1; /* use value.v_int64 rather than v_int */ };
Specifies the GScanner parser configuration. Most settings can be changed during the parsing phase and will affect the lexical parsing of the next unpeeked token.
gchar * |
specifies which characters should be skipped by the scanner (the default is the whitespace characters: space, tab, carriage-return and line-feed). |
|
gchar * |
specifies the characters which can start identifiers (the default is G_CSET_a_2_z, "_", and G_CSET_A_2_Z). |
|
gchar * |
specifies the characters which can be used in identifiers, after the first character (the default is G_CSET_a_2_z, "_0123456789", G_CSET_A_2_Z, G_CSET_LATINS, G_CSET_LATINC). |
|
gchar * |
specifies the characters at the start and end of single-line comments. The default is "#\n" which means that single-line comments start with a '#' and continue until a '\n' (end of line). |
|
guint |
specifies if symbols are case sensitive (the
default is |
|
guint |
specifies if multi-line comments are skipped
and not returned as tokens (the default is |
|
guint |
specifies if single-line comments are skipped
and not returned as tokens (the default is |
|
guint |
specifies if multi-line comments are recognized
(the default is |
|
guint |
specifies if identifiers are recognized (the
default is |
|
guint |
specifies if single-character
identifiers are recognized (the default is |
|
guint |
specifies if |
|
guint |
specifies if symbols are recognized (the default
is |
|
guint |
specifies if binary numbers are recognized (the
default is |
|
guint |
specifies if octal numbers are recognized (the
default is |
|
guint |
specifies if floating point numbers are recognized
(the default is |
|
guint |
specifies if hexadecimal numbers are recognized (the
default is |
|
guint |
specifies if '$' is recognized as a prefix for
hexadecimal numbers (the default is |
|
guint |
specifies if strings can be enclosed in single
quotes (the default is |
|
guint |
specifies if strings can be enclosed in double
quotes (the default is |
|
guint |
specifies if binary, octal and hexadecimal numbers
are reported as G_TOKEN_INT (the default is |
|
guint |
specifies if all numbers are reported as |
|
guint |
specifies if identifiers are reported as strings
(the default is |
|
guint |
specifies if characters are reported by setting
|
|
guint |
specifies if symbols are reported by setting
|
|
guint |
specifies if a symbol is searched for in the
default scope in addition to the current scope (the default is |
|
guint |
use value.v_int64 rather than v_int |
#define G_CSET_a_2_z "abcdefghijklmnopqrstuvwxyz"
The set of lowercase ASCII alphabet characters. Used for specifying valid identifier characters in GScannerConfig.
#define G_CSET_A_2_Z "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
The set of uppercase ASCII alphabet characters. Used for specifying valid identifier characters in GScannerConfig.
#define G_CSET_DIGITS "0123456789"
The set of ASCII digits. Used for specifying valid identifier characters in GScannerConfig.
#define G_CSET_LATINC
The set of uppercase ISO 8859-1 alphabet characters which are not ASCII characters. Used for specifying valid identifier characters in GScannerConfig.
#define G_CSET_LATINS
The set of lowercase ISO 8859-1 alphabet characters which are not ASCII characters. Used for specifying valid identifier characters in GScannerConfig.
The possible types of token returned from each
g_scanner_get_next_token()
call.
the end of the file |
||
a '(' character |
||
a ')' character |
||
a '{' character |
||
a '}' character |
||
a '[' character |
||
a ']' character |
||
a '=' character |
||
a ',' character |
||
not a token |
||
an error occurred |
||
a character |
||
a binary integer |
||
an octal integer |
||
an integer |
||
a hex integer |
||
a floating point number |
||
a string |
||
a symbol |
||
an identifier |
||
a null identifier |
||
one line comment |
||
multi line comment |
A union holding the value of the token.
gpointer |
token symbol value |
|
gchar * |
token identifier value |
|
gulong |
token binary integer value |
|
gulong |
octal integer value |
|
gulong |
integer value |
|
guint64 |
64-bit integer value |
|
gdouble |
floating point value |
|
gulong |
hex integer value |
|
gchar * |
string value |
|
gchar * |
comment value |
|
guchar |
character value |
|
guint |
error value |
The possible errors, used in the v_error
field
of GTokenValue, when the token is a G_TOKEN_ERROR
.