


Does Undefined Behavior Lurk in std::string Chaining in Bjarne Stroustrup\'s Code?
Oct 23, 2024 pm 06:14 PMDoes std::string's Chaining Expression in Bjarne Stroustrup's Code Exhibit Undefined Behavior?
In Bjarne Stroustrup's "The C Programming Language" 4th edition, a code snippet exemplifies chaining using std::string's replace method:
<code class="cpp">void f2() { std::string s = "but I have heard it works even if you don't believe in it"; s.replace(0, 4, "").replace(s.find("even"), 4, "only").replace(s.find(" don't"), 6, ""); assert(s == "I have heard it works only if you believe in it"); }</code>
This code, however, exhibits unspecified behavior instead of invoking undefined behavior.
The reason for this unspecified behavior lies in the order of evaluation, which is unspecified for sub-expressions of chained function calls. In this case, the s.find function calls are evaluated either before or after the first s.replace call, altering the resulting string's length and affecting the subsequent find call's result.
The example in the question demonstrates this: when evaluated by different compilers (clang, gcc), differing results are obtained due to varying evaluation orders.
Details
Function arguments have an unspecified evaluation order, and while chaining function calls introduces a left-to-right evaluation order for each function invocation, the arguments of each call are sequenced before only with respect to that particular function call.
In the example, this indeterminacy arises in the evaluation of s.find("even") and s.find(" don't") with respect to s.replace(0, 4, "").
Ignoring further sub-expression breakdowns, the sequence of evaluation steps and their interdependence can be depicted as follows:
Step 1: s.replace(0, 4, "") // A Step 2: s.find("even") // B Step 3: s.replace(B, 4, "only") // C Step 4: s.find("don't") // D Step 5: s.replace(D, 6, "") // E
While A is sequenced before B, which in turn is sequenced before C, there is no sequencing relationship between B and D with respect to A. As a result, D can be evaluated either before or after A, leading to different outcomes based on the sequence chosen.
C 17 Changes
The C 17 standard strengthens the order of evaluation rules for postfix-expressions and their expression-list, giving the code in question well-specified behavior. The sequencing is as follows:
- The postfix-expression is sequenced before each expression in the expression-list.
- The initialization of each parameter is indeterminately sequenced with respect to that of any other parameter.
Hence, in C 17 and later, this code will always evaluate correctly.
The above is the detailed content of Does Undefined Behavior Lurk in std::string Chaining in Bjarne Stroustrup\'s Code?. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

std::chrono is used in C to process time, including obtaining the current time, measuring execution time, operation time point and duration, and formatting analysis time. 1. Use std::chrono::system_clock::now() to obtain the current time, which can be converted into a readable string, but the system clock may not be monotonous; 2. Use std::chrono::steady_clock to measure the execution time to ensure monotony, and convert it into milliseconds, seconds and other units through duration_cast; 3. Time point (time_point) and duration (duration) can be interoperable, but attention should be paid to unit compatibility and clock epoch (epoch)

There are mainly the following methods to obtain stack traces in C: 1. Use backtrace and backtrace_symbols functions on Linux platform. By including obtaining the call stack and printing symbol information, the -rdynamic parameter needs to be added when compiling; 2. Use CaptureStackBackTrace function on Windows platform, and you need to link DbgHelp.lib and rely on PDB file to parse the function name; 3. Use third-party libraries such as GoogleBreakpad or Boost.Stacktrace to cross-platform and simplify stack capture operations; 4. In exception handling, combine the above methods to automatically output stack information in catch blocks

In C, the POD (PlainOldData) type refers to a type with a simple structure and compatible with C language data processing. It needs to meet two conditions: it has ordinary copy semantics, which can be copied by memcpy; it has a standard layout and the memory structure is predictable. Specific requirements include: all non-static members are public, no user-defined constructors or destructors, no virtual functions or base classes, and all non-static members themselves are PODs. For example structPoint{intx;inty;} is POD. Its uses include binary I/O, C interoperability, performance optimization, etc. You can check whether the type is POD through std::is_pod, but it is recommended to use std::is_trivia after C 11.

To call Python code in C, you must first initialize the interpreter, and then you can achieve interaction by executing strings, files, or calling specific functions. 1. Initialize the interpreter with Py_Initialize() and close it with Py_Finalize(); 2. Execute string code or PyRun_SimpleFile with PyRun_SimpleFile; 3. Import modules through PyImport_ImportModule, get the function through PyObject_GetAttrString, construct parameters of Py_BuildValue, call the function and process return

FunctionhidinginC occurswhenaderivedclassdefinesafunctionwiththesamenameasabaseclassfunction,makingthebaseversioninaccessiblethroughthederivedclass.Thishappenswhenthebasefunctionisn’tvirtualorsignaturesdon’tmatchforoverriding,andnousingdeclarationis

In C, there are three main ways to pass functions as parameters: using function pointers, std::function and Lambda expressions, and template generics. 1. Function pointers are the most basic method, suitable for simple scenarios or C interface compatible, but poor readability; 2. Std::function combined with Lambda expressions is a recommended method in modern C, supporting a variety of callable objects and being type-safe; 3. Template generic methods are the most flexible, suitable for library code or general logic, but may increase the compilation time and code volume. Lambdas that capture the context must be passed through std::function or template and cannot be converted directly into function pointers.

AnullpointerinC isaspecialvalueindicatingthatapointerdoesnotpointtoanyvalidmemorylocation,anditisusedtosafelymanageandcheckpointersbeforedereferencing.1.BeforeC 11,0orNULLwasused,butnownullptrispreferredforclarityandtypesafety.2.Usingnullpointershe

std::move does not actually move anything, it just converts the object to an rvalue reference, telling the compiler that the object can be used for a move operation. For example, when string assignment, if the class supports moving semantics, the target object can take over the source object resource without copying. Should be used in scenarios where resources need to be transferred and performance-sensitive, such as returning local objects, inserting containers, or exchanging ownership. However, it should not be abused, because it will degenerate into a copy without a moving structure, and the original object status is not specified after the movement. Appropriate use when passing or returning an object can avoid unnecessary copies, but if the function returns a local variable, RVO optimization may already occur, adding std::move may affect the optimization. Prone to errors include misuse on objects that still need to be used, unnecessary movements, and non-movable types
