The analyzer performs checks that are categorized into families or "checkers".
The default set of checkers covers a variety of checks targeted at finding security and API usage bugs, dead code, and other logic errors. See the :ref:`default-checkers` checkers list below.
In addition to these, the analyzer contains a number of :ref:`alpha-checkers` (aka alpha checkers). These checkers are under development and are switched off by default. They may crash or emit a higher number of false positives.
The :ref:`debug-checkers` package contains checkers for analyzer developers for debugging purposes.
Table of Contents
- Default Checkers
- core
- core.BitwiseShift (C, C++)
- core.CallAndMessage (C, C++, ObjC)
- core.DivideZero (C, C++, ObjC)
- core.FixedAddressDereference (C, C++, ObjC)
- core.NonNullParamChecker (C, C++, ObjC)
- core.NullDereference (C, C++, ObjC)
- core.StackAddressEscape (C)
- core.UndefinedBinaryOperatorResult (C)
- core.VLASize (C)
- core.uninitialized.ArraySubscript (C)
- core.uninitialized.Assign (C)
- core.uninitialized.Branch (C)
- core.uninitialized.CapturedBlockVariable (C)
- core.uninitialized.UndefReturn (C)
- core.uninitialized.NewArraySize (C++)
- cplusplus
- deadcode
- nullability
- optin
- optin.core.EnumCastOutOfRange (C, C++)
- optin.cplusplus.UninitializedObject (C++)
- optin.cplusplus.VirtualCall (C++)
- optin.mpi.MPI-Checker (C)
- optin.osx.cocoa.localizability.EmptyLocalizationContextChecker (ObjC)
- optin.osx.cocoa.localizability.NonLocalizedStringChecker (ObjC)
- optin.performance.GCDAntipattern
- optin.performance.Padding (C, C++, ObjC)
- optin.portability.UnixAPI
- optin.taint
- security
- security.ArrayBound (C, C++)
- security.cert.env.InvalidPtr
- security.FloatLoopCounter (C)
- security.insecureAPI.UncheckedReturn (C)
- security.insecureAPI.bcmp (C)
- security.insecureAPI.bcopy (C)
- security.insecureAPI.bzero (C)
- security.insecureAPI.getpw (C)
- security.insecureAPI.gets (C)
- security.insecureAPI.mkstemp (C)
- security.insecureAPI.mktemp (C)
- security.insecureAPI.rand (C)
- security.insecureAPI.strcpy (C)
- security.insecureAPI.vfork (C)
- security.insecureAPI.DeprecatedOrUnsafeBufferHandling (C)
- security.MmapWriteExec (C)
- security.PointerSub (C)
- security.PutenvStackArray (C)
- security.SetgidSetuidOrder (C)
- unix
- osx
- osx.API (C)
- osx.NumberObjectConversion (C, C++, ObjC)
- osx.ObjCProperty (ObjC)
- osx.SecKeychainAPI (C)
- osx.cocoa.AtSync (ObjC)
- osx.cocoa.AutoreleaseWrite
- osx.cocoa.ClassRelease (ObjC)
- osx.cocoa.Dealloc (ObjC)
- osx.cocoa.IncompatibleMethodTypes (ObjC)
- osx.cocoa.Loops
- osx.cocoa.MissingSuperCall (ObjC)
- osx.cocoa.NSAutoreleasePool (ObjC)
- osx.cocoa.NSError (ObjC)
- osx.cocoa.NilArg (ObjC)
- osx.cocoa.NonNilReturnValue
- osx.cocoa.ObjCGenerics (ObjC)
- osx.cocoa.RetainCount (ObjC)
- osx.cocoa.RunLoopAutoreleaseLeak
- osx.cocoa.SelfInit (ObjC)
- osx.cocoa.SuperDealloc (ObjC)
- osx.cocoa.UnusedIvars (ObjC)
- osx.cocoa.VariadicMethodTypes (ObjC)
- osx.coreFoundation.CFError (C)
- osx.coreFoundation.CFNumber (C)
- osx.coreFoundation.CFRetainRelease (C)
- osx.coreFoundation.containers.OutOfBounds (C)
- osx.coreFoundation.containers.PointerSizedValues (C)
- Fuchsia
- WebKit
- core
- Experimental Checkers
- alpha.clone
- alpha.core
- alpha.core.BoolAssignment (ObjC)
- alpha.core.C11Lock
- alpha.core.CastSize (C)
- alpha.core.CastToStruct (C, C++)
- alpha.core.Conversion (C, C++, ObjC)
- alpha.core.DynamicTypeChecker (ObjC)
- alpha.core.FixedAddr (C)
- alpha.core.PointerArithm (C)
- alpha.core.StackAddressAsyncEscape (ObjC)
- alpha.core.StdVariant (C++)
- alpha.core.TestAfterDivZero (C)
- alpha.cplusplus
- alpha.deadcode
- alpha.fuchsia
- alpha.llvm
- alpha.osx
- alpha.security
- alpha.security.cert
- alpha.unix
- alpha.WebKit
- alpha.webkit.ForwardDeclChecker
- alpha.webkit.MemoryUnsafeCastChecker
- alpha.webkit.NoUncheckedPtrMemberChecker
- alpha.webkit.NoUnretainedMemberChecker
- alpha.webkit.UnretainedLambdaCapturesChecker
- alpha.webkit.UncountedCallArgsChecker
- alpha.webkit.UncheckedCallArgsChecker
- alpha.webkit.UnretainedCallArgsChecker
- alpha.webkit.UncountedLocalVarsChecker
- alpha.webkit.UncheckedLocalVarsChecker
- alpha.webkit.UnretainedLocalVarsChecker
- webkit.RetainPtrCtorAdoptChecker
- Debug Checkers
Models core language features and contains general-purpose checkers such as division by zero, null pointer dereference, usage of uninitialized values, etc. These checkers must be always switched on as other checker rely on them.
Finds undefined behavior caused by the bitwise left- and right-shift operator operating on integer types.
By default, this checker only reports situations when the right operand is either negative or larger than the bit width of the type of the left operand; these are logically unsound.
Moreover, if the pedantic mode is activated by -analyzer-config core.BitwiseShift:Pedantic=true
, then this checker also reports situations where the _left_ operand of a shift operator is negative or overflow occurs during the right shift of a signed value. (Most compilers handle these predictably, but the C standard and the C++ standards before C++20 say that they're undefined behavior. In the C++20 standard these constructs are well-defined, so activating pedantic mode in C++20 has no effect.)
Examples
static_assert(sizeof(int) == 4, "assuming 32-bit int") void basic_examples(int a, int b) { if (b < 0) { b = a << b; // warn: right operand is negative in left shift } elseif (b >= 32) { b = a >> b; // warn: right shift overflows the capacity of 'int' } } intpedantic_examples(int a, int b) { if (a < 0) { return a >> b; // warn: left operand is negative in right shift } a = 1000u << 31; // OK, overflow of unsigned value is well-defined, a == 0if (b > 10) { a = b << 31; // this is undefined before C++20, but the checker doesn't// warn because it doesn't know the exact value of b } return1000 << 31; // warn: this overflows the capacity of 'int' }
Solution
Ensure the shift operands are in proper range before shifting.
Check for logical errors for function calls and Objective-C message expressions (e.g., uninitialized arguments, null function pointers).
.. literalinclude:: checkers/callandmessage_example.c :language: objc
Check for division by zero.
.. literalinclude:: checkers/dividezero_example.c :language: c
Check for dereferences of fixed addresses.
A pointer contains a fixed address if it was set to a hard-coded value or it becomes otherwise obvious that at that point it can have only a single fixed numerical value.
voidtest1() { int*p= (int*)0x020; intx=p[0]; // warn } voidtest2(int*p) { if (p== (int*)-1) *p=0; // warn } voidtest3() { int (*p_function)(char, char); p_function= (int (*)(char, char))0x04080; intx= (*p_function)('x', 'y'); // NO warning yet at functon pointer calls }
If the analyzer option suppress-dereferences-from-any-address-space
is set to true (the default value), then this checker never reports dereference of pointers with a specified address space. If the option is set to false, then reports from the specific x86 address spaces 256, 257 and 258 are still suppressed, but fixed address dereferences from other address spaces are reported.
Check for null pointers passed as arguments to a function whose arguments are references or marked with the 'nonnull' attribute.
intf(int *p) __attribute__((nonnull)); voidtest(int *p) { if (!p) f(p); // warn }
Check for dereferences of null pointers.
// Cvoidtest(int *p) { if (p) return; int x = p[0]; // warn } // Cvoidtest(int *p) { if (!p) *p = 0; // warn } // C++ class C { public: int x; }; voidtest() { C *pc = 0; int k = pc->x; // warn } // Objective-C@interfaceMyClass { @publicint x; } @endvoidtest() { MyClass *obj = 0; obj->x = 1; // warn }
Null pointer dereferences of pointers with address spaces are not always defined as error. Specifically on x86/x86-64 target if the pointer address space is 256 (x86 GS Segment), 257 (x86 FS Segment), or 258 (x86 SS Segment), a null dereference is not defined as error. See X86/X86-64 Language Extensions for reference.
If the analyzer option suppress-dereferences-from-any-address-space
is set to true (the default value), then this checker never reports dereference of pointers with a specified address space. If the option is set to false, then reports from the specific x86 address spaces 256, 257 and 258 are still suppressed, but null dereferences from other address spaces are reported.
Check that addresses to stack memory do not escape the function.
charconst*p; voidtest() { charconststr[] ="string"; p=str; // warn } void*test() { return__builtin_alloca(12); // warn } voidtest() { staticint*x; inty; x=&y; // warn }
Check for undefined results of binary operators.
voidtest() { intx; inty=x+1; // warn: left operand is garbage }
Check for declarations of Variable Length Arrays (VLA) of undefined, zero or negative size.
voidtest() { intx; intvla1[x]; // warn: garbage as size } voidtest() { intx=0; intvla2[x]; // warn: zero size }
The checker also gives warning if the TaintPropagation checker is switched on and an unbound, attacker controlled (tainted) value is used to define the size of the VLA.
voidtaintedVLA(void) { intx; scanf("%d", &x); intvla[x]; // Declared variable-length array (VLA) has tainted (attacker controlled) size, that can be 0 or negative } voidtaintedVerfieidVLA(void) { intx; scanf("%d", &x); if (x<1) return; intvla[x]; // no-warning. The analyzer can prove that x must be positive. }
Check for uninitialized values used as array subscripts.
voidtest() { inti, a[10]; intx=a[i]; // warn: array subscript is undefined }
Check for assigning uninitialized values.
voidtest() { intx; x |= 1; // warn: left expression is uninitialized }
Check for uninitialized values used as branch conditions.
voidtest() { intx; if (x) // warnreturn; }
Check for blocks that capture uninitialized values.
voidtest() { intx; ^{ inty=x; }(); // warn }
Check for uninitialized values being returned to the caller.
inttest() { intx; returnx; // warn }
Check if the element count in new[] is garbage or undefined.
voidtest() { int n; int *arr = newint[n]; // warn: Element count in new[] is a garbage valuedelete[] arr; }
C++ Checkers.
Reports destructions of arrays of polymorphic objects that are destructed as their base class. If the dynamic type of the array is different from its static type, calling delete[] is undefined.
This checker corresponds to the SEI CERT rule EXP51-CPP: Do not delete an array through a pointer of the incorrect type.
classBase { public:virtual~Base() {} }; classDerived : publicBase {}; Base *create() { Base *x = new Derived[10]; // note: Casting from 'Derived' to 'Base' herereturn x; } voidfoo() { Base *x = create(); delete[] x; // warn: Deleting an array of 'Derived' objects as their base class 'Base' is undefined }
Limitations
The checker does not emit note tags when casting to and from reference types, even though the pointer values are tracked across references.
voidfoo() { Derived *d = new Derived[10]; Derived &dref = *d; Base &bref = static_cast<Base&>(dref); // no note Base *b = &bref; delete[] b; // warn: Deleting an array of 'Derived' objects as their base class 'Base' is undefined }
Check for inner pointers of C++ containers used after re/deallocation.
Many container methods in the C++ standard library are known to invalidate "references" (including actual references, iterators and raw pointers) to elements of the container. Using such references after they are invalidated causes undefined behavior, which is a common source of memory errors in C++ that this checker is capable of finding.
The checker is currently limited to std::string
objects and doesn't recognize some of the more sophisticated approaches to passing unowned pointers around, such as std::string_view
.
voidderef_after_assignment() { std::string s = "llvm"; constchar *c = s.data(); // note: pointer to inner buffer of 'std::string' obtained here s = "clang"; // note: inner buffer of 'std::string' reallocated by call to 'operator='consume(c); // warn: inner pointer of container used after re/deallocation } constchar *return_temp(int x) { returnstd::to_string(x).c_str(); // warn: inner pointer of container used after re/deallocation// note: pointer to inner buffer of 'std::string' obtained here// note: inner buffer of 'std::string' deallocated by call to destructor }
Find use-after-move bugs in C++. This includes method calls on moved-from objects, assignment of a moved-from object, and repeated move of a moved-from object.
structA { voidfoo() {} }; voidf1() { A a; A b = std::move(a); // note: 'a' became 'moved-from' here a.foo(); // warn: method call on a 'moved-from' object 'a' } voidf2() { A a; A b = std::move(a); A c(std::move(a)); // warn: move of an already moved-from object } voidf3() { A a; A b = std::move(a); b = a; // warn: copy of moved-from object }
The checker option WarnOn
controls on what objects the use-after-move is checked:
- The most strict value is
KnownsOnly
, in this mode only objects are checked whose type is known to be move-unsafe. These include most STL objects (but excluding move-safe ones) and smart pointers. - With option value
KnownsAndLocals
local variables (of any type) are additionally checked. The idea behind this is that local variables are usually not tempting to be re-used so an use after move is more likely a bug than with member variables. - With option value
All
any use-after move condition is checked on all kinds of variables, excluding global variables and known move-safe cases.
Default value is KnownsAndLocals
.
Calls of methods named empty()
or isEmpty()
are allowed on moved-from objects because these methods are considered as move-safe. Functions called reset()
, destroy()
, clear()
, assign
, resize
, shrink
are treated as state-reset functions and are allowed on moved-from objects, these make the object valid again. This applies to any type of object (not only STL ones).
Check for double-free and use-after-free problems. Traces memory managed by new/delete.
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
.. literalinclude:: checkers/newdelete_example.cpp :language: cpp
Check for memory leaks. Traces memory managed by new/delete.
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
voidtest() { int *p = newint; } // warn
Check if default placement new is provided with pointers to sufficient storage capacity.
#include<new>voidf() { short s; long *lp = ::new (&s) long; // warn }
Checks C++ copy and move assignment operators for self assignment.
Checks std::string operations.
Checks if the cstring pointer from which the std::string
object is constructed is NULL
or not. If the checker cannot reason about the nullness of the pointer it will assume that it was non-null to satisfy the precondition of the constructor.
This checker is capable of checking the SEI CERT C++ coding rule STR51-CPP. Do not attempt to create a std::string from a null pointer.
#include<string>voidf(constchar *p) { if (!p) { std::string msg(p); // warn: The parameter must not be null } }
When virtual methods are called during construction and destruction the polymorphism is restricted to the class that's being constructed or destructed because the more derived contexts are either not yet initialized or already destructed.
This checker reports situations where this restricted polymorphism causes a call to a pure virtual method, which is undefined behavior. (See also the related checker :ref:`optin-cplusplus-VirtualCall` which reports situations where the restricted polymorphism affects a call and the called method is not pure virtual – but may be still surprising for the programmer.)
structA { virtualintgetKind() = 0; A() { // warn: This calls the pure virtual method A::getKind().log << "Constructing " << getKind(); } virtual~A() { releaseResources(); } voidreleaseResources() { // warn: This can call the pure virtual method A::getKind() when this is// called from the destructor.callSomeFunction(getKind()); } };
Dead Code Checkers.
Check for values stored to variables that are never read afterwards.
voidtest() { intx; x=1; // warn }
The WarnForDeadNestedAssignments
option enables the checker to emit warnings for nested dead assignments. You can disable with the -analyzer-config deadcode.DeadStores:WarnForDeadNestedAssignments=false
. Defaults to true.
Would warn for this e.g.: if ((y = make_int())) { }
Checkers (mostly Objective C) that warn for null pointer passing and dereferencing errors.
Warns when a null pointer is passed to a pointer which has a _Nonnull type.
if (name != nil) return; // Warning: nil passed to a callee that requires a non-null 1st parameterNSString *greeting = [@"Hello "stringByAppendingString:name];
Warns when a null pointer is returned from a function that has _Nonnull return type.
- (nonnull id)firstChild { id result = nil; if ([_children count] > 0) result = _children[0]; // Warning: nil returned from a method that is expected// to return a non-null valuereturn result; }
Warns when a null pointer is returned from a function annotated with __attribute__((returns_nonnull))
int global; __attribute__((returns_nonnull)) void* getPtr(void* p); void* getPtr(void* p) { if (p) { // forgot to negate the conditionreturn &global; } // Warning: nullptr returned from a function that is expected// to return a non-null valuereturn p; }
Warns when a nullable pointer is dereferenced.
struct LinkedList { int data; struct LinkedList *next; }; struct LinkedList * _Nullable getNext(struct LinkedList *l); voidupdateNextData(struct LinkedList *list, int newData) { struct LinkedList *next = getNext(list); // Warning: Nullable pointer is dereferenced next->data = 7; }
Warns when a nullable pointer is passed to a pointer which has a _Nonnull type.
typedefstruct Dummy { int val; } Dummy; Dummy *_Nullable returnsNullable(); voidtakesNonnull(Dummy *_Nonnull); voidtest() { Dummy *p = returnsNullable(); takesNonnull(p); // warn }
Warns when a nullable pointer is returned from a function that has _Nonnull return type.
Checkers for portability, performance, optional security and coding style specific rules.
Check for integer to enumeration casts that would produce a value with no corresponding enumerator. This is not necessarily undefined behavior, but can lead to nasty surprises, so projects may decide to use a coding standard that disallows these "unusual" conversions.
Note that no warnings are produced when the enum type (e.g. std::byte) has no enumerators at all.
enum WidgetKind { A=1, B, C, X=99 }; voidfoo() { WidgetKind c = static_cast<WidgetKind>(3); // OK WidgetKind x = static_cast<WidgetKind>(99); // OK WidgetKind d = static_cast<WidgetKind>(4); // warn }
Limitations
This checker does not accept the coding pattern where an enum type is used to store combinations of flag values:
enum AnimalFlags { HasClaws = 1, CanFly = 2, EatsFish = 4, Endangered = 8 }; AnimalFlags operator|(AnimalFlags a, AnimalFlags b) { returnstatic_cast<AnimalFlags>(static_cast<int>(a) | static_cast<int>(b)); } auto flags = HasClaws | CanFly;
Projects that use this pattern should not enable this optin checker.
This checker reports uninitialized fields in objects created after a constructor call. It doesn't only find direct uninitialized fields, but rather makes a deep inspection of the object, analyzing all of its fields' subfields. The checker regards inherited fields as direct fields, so one will receive warnings for uninitialized inherited data members as well.
// With Pedantic and CheckPointeeInitialization set to truestructA { structB { int x; // note: uninitialized field 'this->b.x'// note: uninitialized field 'this->bptr->x'int y; // note: uninitialized field 'this->b.y'// note: uninitialized field 'this->bptr->y' }; int *iptr; // note: uninitialized pointer 'this->iptr' B b; B *bptr; char *cptr; // note: uninitialized pointee 'this->cptr'A (B *bptr, char *cptr) : bptr(bptr), cptr(cptr) {} }; voidf() { A::B b; char c; A a(&b, &c); // warning: 6 uninitialized fields// after the constructor call } // With Pedantic set to false and// CheckPointeeInitialization set to true// (every field is uninitialized)structA { structB { int x; int y; }; int *iptr; B b; B *bptr; char *cptr; A (B *bptr, char *cptr) : bptr(bptr), cptr(cptr) {} }; voidf() { A::B b; char c; A a(&b, &c); // no warning } // With Pedantic set to true and// CheckPointeeInitialization set to false// (pointees are regarded as initialized)structA { structB { int x; // note: uninitialized field 'this->b.x'int y; // note: uninitialized field 'this->b.y' }; int *iptr; // note: uninitialized pointer 'this->iptr' B b; B *bptr; char *cptr; A (B *bptr, char *cptr) : bptr(bptr), cptr(cptr) {} }; voidf() { A::B b; char c; A a(&b, &c); // warning: 3 uninitialized fields// after the constructor call }
Options
This checker has several options which can be set from command line (e.g. -analyzer-config optin.cplusplus.UninitializedObject:Pedantic=true
):
Pedantic
(boolean). If to false, the checker won't emit warnings for objects that don't have at least one initialized field. Defaults to false.NotesAsWarnings
(boolean). If set to true, the checker will emit a warning for each uninitialized field, as opposed to emitting one warning per constructor call, and listing the uninitialized fields that belongs to it in notes. Defaults to false.CheckPointeeInitialization
(boolean). If set to false, the checker will not analyze the pointee of pointer/reference fields, and will only check whether the object itself is initialized. Defaults to false.IgnoreRecordsWithField
(string). If supplied, the checker will not analyze structures that have a field with a name or type name that matches the given pattern. Defaults to "".
When virtual methods are called during construction and destruction the polymorphism is restricted to the class that's being constructed or destructed because the more derived contexts are either not yet initialized or already destructed.
Although this behavior is well-defined, it can surprise the programmer and cause unintended behavior, so this checker reports calls that appear to be virtual calls but can be affected by this restricted polymorphism.
Note that situations where this restricted polymorphism causes a call to a pure virtual method (which is definitely invalid, triggers undefined behavior) are reported by another checker::ref:`cplusplus-PureVirtualCall` and this checker does not report them.
structA { virtualintgetKind(); A() { // warn: This calls A::getKind() even if we are constructing an instance// of a different class that is derived from A.log << "Constructing " << getKind(); } virtual~A() { releaseResources(); } voidreleaseResources() { // warn: This can be called within ~A() and calls A::getKind() even if// we are destructing a class that is derived from A.callSomeFunction(getKind()); } };
Checks MPI code.
voidtest() { doublebuf=0; MPI_RequestsendReq1; MPI_Ireduce(MPI_IN_PLACE, &buf, 1, MPI_DOUBLE, MPI_SUM, 0, MPI_COMM_WORLD, &sendReq1); } // warn: request 'sendReq1' has no matching wait.voidtest() { doublebuf=0; MPI_RequestsendReq; MPI_Isend(&buf, 1, MPI_DOUBLE, 0, 0, MPI_COMM_WORLD, &sendReq); MPI_Irecv(&buf, 1, MPI_DOUBLE, 0, 0, MPI_COMM_WORLD, &sendReq); // warnMPI_Isend(&buf, 1, MPI_DOUBLE, 0, 0, MPI_COMM_WORLD, &sendReq); // warnMPI_Wait(&sendReq, MPI_STATUS_IGNORE); } voidmissingNonBlocking() { intrank=0; MPI_Comm_rank(MPI_COMM_WORLD, &rank); MPI_RequestsendReq1[10][10][10]; MPI_Wait(&sendReq1[1][7][9], MPI_STATUS_IGNORE); // warn }
Check that NSLocalizedString macros include a comment for context.
- (void)test { NSString *string = NSLocalizedString(@"LocalizedString", nil); // warnNSString *string2 = NSLocalizedString(@"LocalizedString", @""); // warnNSString *string3 = NSLocalizedStringWithDefaultValue( @"LocalizedString", nil, [[NSBundlealloc] init], nil,@""); // warn }
Warns about uses of non-localized NSStrings passed to UI methods expecting localized NSStrings.
NSString *alarmText = NSLocalizedString(@"Enabled", @"Indicates alarm is turned on"); if (!isEnabled) { alarmText = @"Disabled"; } UILabel *alarmStateLabel = [[UILabel alloc] init]; // Warning: User-facing text should use localized string macro [alarmStateLabel setText:alarmText];
Check for performance anti-patterns when using Grand Central Dispatch.
Check for excessively padded structs.
This checker detects structs with excessive padding, which can lead to wasted memory thus decreased performance by reducing the effectiveness of the processor cache. Padding bytes are added by compilers to align data accesses as some processors require data to be aligned to certain boundaries. On others, unaligned data access are possible, but impose significantly larger latencies.
To avoid padding bytes, the fields of a struct should be ordered by decreasing by alignment. Usually, its easier to think of the sizeof
of the fields, and ordering the fields by sizeof
would usually also lead to the same optimal layout.
In rare cases, one can use the #pragma pack(1)
directive to enforce a packed layout too, but it can significantly increase the access times, so reordering the fields is usually a better solution.
// warn: Excessive padding in 'struct NonOptimal' (35 padding bytes, where 3 is optimal)structNonOptimal { char c1; // 7 bytes of padding std::int64_t big1; // 8 byteschar c2; // 7 bytes of padding std::int64_t big2; // 8 byteschar c3; // 7 bytes of padding std::int64_t big3; // 8 byteschar c4; // 7 bytes of padding std::int64_t big4; // 8 byteschar c5; // 7 bytes of padding }; static_assert(sizeof(NonOptimal) == 4*8+5+5*7); // no-warning: The fields are nicely aligned to have the minimal amount of padding bytes.structOptimal { std::int64_t big1; // 8 bytes std::int64_t big2; // 8 bytes std::int64_t big3; // 8 bytes std::int64_t big4; // 8 byteschar c1; char c2; char c3; char c4; char c5; // 3 bytes of padding }; static_assert(sizeof(Optimal) == 4*8+5+3); // no-warning: Bit packing representation is also accepted by this checker, but// it can significantly increase access times, so prefer reordering the fields. #pragma pack(1) structBitPacked { char c1; std::int64_t big1; // 8 byteschar c2; std::int64_t big2; // 8 byteschar c3; std::int64_t big3; // 8 byteschar c4; std::int64_t big4; // 8 byteschar c5; }; static_assert(sizeof(BitPacked) == 4*8+5);
The AllowedPad
option can be used to specify a threshold for the number padding bytes raising the warning. If the number of padding bytes of the struct and the optimal number of padding bytes differ by more than the threshold value, a warning will be raised.
By default, the AllowedPad
threshold is 24 bytes.
To override this threshold to e.g. 4 bytes, use the -analyzer-config optin.performance.Padding:AllowedPad=4
option.
Finds implementation-defined behavior in UNIX/Posix functions.
Checkers implementing taint analysis.
Taint analysis identifies potential security vulnerabilities where the attacker can inject malicious data to the program to execute an attack (privilege escalation, command injection, SQL injection etc.).
The malicious data is injected at the taint source (e.g. getenv()
call) which is then propagated through function calls and being used as arguments of sensitive operations, also called as taint sinks (e.g. system()
call).
One can defend against this type of vulnerability by always checking and sanitizing the potentially malicious, untrusted user input.
The goal of the checker is to discover and show to the user these potential taint source-sink pairs and the propagation call chain.
The most notable examples of taint sources are:
- data from network
- files or standard input
- environment variables
- data from databases
Let us examine a practical example of a Command Injection attack.
// Command Injection Vulnerability Exampleintmain(intargc, char**argv) { charcmd[2048] ="/bin/cat "; charfilename[1024]; printf("Filename:"); scanf (" %1023[^\n]", filename); // The attacker can inject a shell escape herestrcat(cmd, filename); system(cmd); // Warning: Untrusted data is passed to a system call }
The program prints the content of any user specified file. Unfortunately the attacker can execute arbitrary commands with shell escapes. For example with the following input the ls command is also executed after the contents of /etc/shadow is printed. Input: /etc/shadow ; ls /
The analysis implemented in this checker points out this problem.
One can protect against such attack by for example checking if the provided input refers to a valid file and removing any invalid user input.
// No vulnerability anymore, but we still get the warningvoidsanitizeFileName(char*filename){ if (access(filename,F_OK)){// Verifying user inputprintf("File does not exist\n"); filename[0]='\0'; } } intmain(intargc, char**argv) { charcmd[2048] ="/bin/cat "; charfilename[1024]; printf("Filename:"); scanf (" %1023[^\n]", filename); // The attacker can inject a shell escape heresanitizeFileName(filename);// filename is safe after this pointif (!filename[0]) return-1; strcat(cmd, filename); system(cmd); // Superfluous Warning: Untrusted data is passed to a system call }
Unfortunately, the checker cannot discover automatically that the programmer have performed data sanitation, so it still emits the warning.
One can get rid of this superfluous warning by telling by specifying the sanitation functions in the taint configuration file (see :doc:`user-docs/TaintAnalysisConfiguration`).
Filters: - Name: sanitizeFileNameArgs: [0]
The clang invocation to pass the configuration file location:
clang --analyze -Xclang -analyzer-config -Xclang optin.taint.TaintPropagation:Config=`pwd`/taint_config.yml ...
If you are validating your inputs instead of sanitizing them, or don't want to mention each sanitizing function in our configuration, you can use a more generic approach.
Introduce a generic no-op csa_mark_sanitized(..) function to tell the Clang Static Analyzer that the variable is safe to be used on that analysis path.
// Marking sanitized variables safe.// No vulnerability anymore, no warning.// User csa_mark_sanitize function is for the analyzer only#ifdef__clang_analyzer__voidcsa_mark_sanitized(constvoid*); #endifintmain(intargc, char**argv) { charcmd[2048] ="/bin/cat "; charfilename[1024]; printf("Filename:"); scanf (" %1023[^\n]", filename); if (access(filename,F_OK)){// Verifying user inputprintf("File does not exist\n"); return-1; } #ifdef__clang_analyzer__csa_mark_sanitized(filename); // Indicating to CSA that filename variable is safe to be used after this point#endifstrcat(cmd, filename); system(cmd); // No warning }
Similarly to the previous example, you need to define a Filter function in a YAML configuration file and add the csa_mark_sanitized function.
Filters: - Name: csa_mark_sanitizedArgs: [0]
Then calling csa_mark_sanitized(X) will tell the analyzer that X is safe to be used after this point, because its contents are verified. It is the responsibility of the programmer to ensure that this verification was indeed correct. Please note that csa_mark_sanitized function is only declared and used during Clang Static Analysis and skipped in (production) builds.
Further examples of injection vulnerabilities this checker can find.
voidtest() { charx=getchar(); // 'x' marked as taintedsystem(&x); // warn: untrusted data is passed to a system call } // note: compiler internally checks if the second param to// sprintf is a string literal or not.// Use -Wno-format-security to suppress compiler warning.voidtest() { chars[10], buf[10]; fscanf(stdin, "%s", s); // 's' marked as taintedsprintf(buf, s); // warn: untrusted data used as a format string }
There are built-in sources, propagations and sinks even if no external taint configuration is provided.
- Default sources:
_IO_getc
,fdopen
,fopen
,freopen
,get_current_dir_name
,getch
,getchar
,getchar_unlocked
,getwd
,getcwd
,getgroups
,gethostname
,getlogin
,getlogin_r
,getnameinfo
,gets
,gets_s
,getseuserbyname
,readlink
,readlinkat
,scanf
,scanf_s
,socket
,wgetch
- Default propagations rules:
atoi
,atol
,atoll
,basename
,dirname
,fgetc
,fgetln
,fgets
,fnmatch
,fread
,fscanf
,fscanf_s
,index
,inflate
,isalnum
,isalpha
,isascii
,isblank
,iscntrl
,isdigit
,isgraph
,islower
,isprint
,ispunct
,isspace
,isupper
,isxdigit
,memchr
,memrchr
,sscanf
,getc
,getc_unlocked
,getdelim
,getline
,getw
,memcmp
,memcpy
,memmem
,memmove
,mbtowc
,pread
,qsort
,qsort_r
,rawmemchr
,read
,recv
,recvfrom
,rindex
,strcasestr
,strchr
,strchrnul
,strcasecmp
,strcmp
,strcspn
,strncasecmp
,strncmp
,strndup
,strndupa
,strpbrk
,strrchr
,strsep
,strspn
,strstr
,strtol
,strtoll
,strtoul
,strtoull
,tolower
,toupper
,ttyname
,ttyname_r
,wctomb
,wcwidth
- Default sinks:
printf
,setproctitle
,system
,popen
,execl
,execle
,execlp
,execv
,execvp
,execvP
,execve
,dlopen
Please note that there are no built-in filter functions.
One can configure their own taint sources, sinks, and propagation rules by providing a configuration file via checker option optin.taint.TaintPropagation:Config
. The configuration file is in YAML format. The taint-related options defined in the config file extend but do not override the built-in sources, rules, sinks. The format of the external taint configuration file is not stable, and could change without any notice even in a non-backward compatible way.
For a more detailed description of configuration options, please see the :doc:`user-docs/TaintAnalysisConfiguration`. For an example see :ref:`clangsa-taint-configuration-example`.
Configuration
- Config Specifies the name of the YAML configuration file. The user can define their own taint sources and sinks.
Related Guidelines
- CWE Data Neutralization Issues
- SEI Cert STR02-C. Sanitize data passed to complex subsystems
- SEI Cert ENV33-C. Do not call system()
- ENV03-C. Sanitize the environment when invoking external programs
Limitations
- The taintedness property is not propagated through function calls which are unknown (or too complex) to the analyzer, unless there is a specific propagation rule built-in to the checker or given in the YAML configuration file. This causes potential true positive findings to be lost.
This checker warns for cases when the size
parameter of the malloc
, calloc
, realloc
, alloca
or the size parameter of the array new C++ operator is tainted (potentially attacker controlled). If an attacker can inject a large value as the size parameter, memory exhaustion denial of service attack can be carried out.
The analyzer emits warning only if it cannot prove that the size parameter is within reasonable bounds (<= SIZE_MAX/4
). This functionality partially covers the SEI Cert coding standard rule INT04-C.
You can silence this warning either by bound checking the size
parameter, or by explicitly marking the size
parameter as sanitized. See the :ref:`optin-taint-GenericTaint` checker for an example.
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
voidvulnerable(void) { size_tsize=0; scanf("%zu", &size); int*p=malloc(size); // warn: malloc is called with a tainted (potentially attacker controlled) valuefree(p); } voidnot_vulnerable(void) { size_tsize=0; scanf("%zu", &size); if (1024<size) return; int*p=malloc(size); // No warning expected as the the user input is boundfree(p); } voidvulnerable_cpp(void) { size_tsize=0; scanf("%zu", &size); int*ptr=newint[size];// warn: Memory allocation function is called with a tainted (potentially attacker controlled) valuedelete[] ptr; }
This checker warns when the denominator in a division operation is a tainted (potentially attacker controlled) value. If the attacker can set the denominator to 0, a runtime error can be triggered. The checker warns when the denominator is a tainted value and the analyzer cannot prove that it is not 0. This warning is more pessimistic than the :ref:`core-DivideZero` checker which warns only when it can prove that the denominator is 0.
intvulnerable(intn) { size_tsize=0; scanf("%zu", &size); returnn / size; // warn: Division by a tainted value, possibly zero } intnot_vulnerable(intn) { size_tsize=0; scanf("%zu", &size); if (!size) return0; returnn / size; // no warning }
Security related checkers.
Report out of bounds access to memory that is before the start or after the end of the accessed region (array, heap-allocated region, string literal etc.). This usually means incorrect indexing, but the checker also detects access via the operators *
and ->
.
voidtest_underflow(intx) { intbuf[100][100]; if (x<0) buf[0][x] =1; // warn } voidtest_overflow() { intbuf[100]; int*p=buf+100; *p=1; // warn }
If checkers like :ref:`unix-Malloc` or :ref:`cplusplus-NewDelete` are enabled to model the behavior of malloc()
, operator new
and similar allocators), then this checker can also reports out of bounds access to dynamically allocated memory:
int *test_dynamic() { int *mem = newint[100]; mem[-1] = 42; // warnreturn mem; }
In uncertain situations (when the checker can neither prove nor disprove that overflow occurs), the checker assumes that the the index (more precisely, the memory offeset) is within bounds.
However, if :ref:`optin-taint-GenericTaint` is enabled and the index/offset is tainted (i.e. it is influenced by an untrusted souce), then this checker reports the potential out of bounds access:
voidtest_with_tainted_index() { chars[] ="abc"; intx=getchar(); charc=s[x]; // warn: potential out of bounds access with tainted index }
Note
This checker is an improved and renamed version of the checker that was previously known as alpha.security.ArrayBoundV2
. The old checker alpha.security.ArrayBound
was removed when the (previously "experimental") V2 variant became stable enough for regular use.
Corresponds to SEI CERT Rules ENV31-C and ENV34-C.
- ENV31-C: Rule is about the possible problem with
main
function's third argument, environment pointer, "envp". When environment array is modified using some modification function such asputenv
,setenv
or others, It may happen that memory is reallocated, however "envp" is not updated to reflect the changes and points to old memory region. - ENV34-C: Some functions return a pointer to a statically allocated buffer. Consequently, subsequent call of these functions will invalidate previous pointer. These functions include:
getenv
,localeconv
,asctime
,setlocale
,strerror
intmain(intargc, constchar*argv[], constchar*envp[]) { if (setenv("MY_NEW_VAR", "new_value", 1) !=0) { // setenv call may invalidate 'envp'/* Handle error */ } if (envp!=NULL) { for (size_ti=0; envp[i] !=NULL; ++i) { puts(envp[i]); // envp may no longer point to the current environment// this program has unanticipated behavior, since envp// does not reflect changes made by setenv function. } } return0; } voidprevious_call_invalidation() { char*p, *pp; p=getenv("VAR"); setenv("SOMEVAR", "VALUE", /*overwrite = */1); // call to 'setenv' may invalidate p*p; // dereferencing invalid pointer }
The InvalidatingGetEnv
option is available for treating getenv
calls as invalidating. When enabled, the checker issues a warning if getenv
is called multiple times and their results are used without first creating a copy. This level of strictness might be considered overly pedantic for the commonly used getenv
implementations.
To enable this option, use: -analyzer-config security.cert.env.InvalidPtr:InvalidatingGetEnv=true
.
By default, this option is set to false.
When this option is enabled, warnings will be generated for scenarios like the following:
char*p=getenv("VAR"); char*pp=getenv("VAR2"); // assumes this call can invalidate `env`strlen(p); // warns about accessing invalid ptr
Warn on using a floating point value as a loop counter (CERT: FLP30-C, FLP30-CPP).
voidtest() { for (floatx=0.1f; x <= 1.0f; x+=0.1f) {} // warn }
Warn on uses of functions whose return values must be always checked.
voidtest() { setuid(1); // warn }
Warn on uses of the 'bcmp' function.
voidtest() { bcmp(ptr0, ptr1, n); // warn }
Warn on uses of the 'bcopy' function.
voidtest() { bcopy(src, dst, n); // warn }
Warn on uses of the 'bzero' function.
voidtest() { bzero(ptr, n); // warn }
Warn on uses of the 'getpw' function.
voidtest() { charbuff[1024]; getpw(2, buff); // warn }
Warn on uses of the 'gets' function.
voidtest() { charbuff[1024]; gets(buff); // warn }
Warn when 'mkstemp' is passed fewer than 6 X's in the format string.
voidtest() { mkstemp("XX"); // warn }
Warn on uses of the mktemp
function.
voidtest() { char*x=mktemp("/tmp/zxcv"); // warn: insecure, use mkstemp }
Warn on uses of inferior random number generating functions (only if arc4random function is available): drand48, erand48, jrand48, lcong48, lrand48, mrand48, nrand48, random, rand_r
.
voidtest() { random(); // warn }
Warn on uses of the strcpy
and strcat
functions.
voidtest() { charx[4]; char*y="abcd"; strcpy(x, y); // warn }
Warn on uses of the 'vfork' function.
voidtest() { vfork(); // warn }
Warn on occurrences of unsafe or deprecated buffer handling functions, which now have a secure variant: sprintf, fprintf, vsprintf, scanf, wscanf, fscanf, fwscanf, vscanf, vwscanf, vfscanf, vfwscanf, sscanf, swscanf, vsscanf, vswscanf, swprintf, snprintf, vswprintf, vsnprintf, memcpy, memmove, strncpy, strncat, memset
voidtest() { charbuf [5]; strncpy(buf, "a", 1); // warn }
Warn on mmap()
calls with both writable and executable access.
voidtest(intn) { void*c=mmap(NULL, 32, PROT_READ | PROT_WRITE | PROT_EXEC, MAP_PRIVATE | MAP_ANON, -1, 0); // warn: Both PROT_WRITE and PROT_EXEC flags are set. This can lead to// exploitable memory regions, which could be overwritten with malicious// code }
Check for pointer subtractions on two pointers pointing to different memory chunks. According to the C standard §6.5.6 only subtraction of pointers that point into (or one past the end) the same array object is valid (for this purpose non-array variables are like arrays of size 1). This checker only searches for different memory objects at subtraction, but does not check if the array index is correct. Furthermore, only cases are reported where stack-allocated objects are involved (no warnings on pointers to memory allocated by malloc).
voidtest() { inta, b, c[10], d[10]; intx=&c[3] -&c[1]; x=&d[4] -&c[1]; // warn: 'c' and 'd' are different arraysx= (&a+1) -&a; x=&b-&a; // warn: 'a' and 'b' are different variables } structS { intx[10]; inty[10]; }; voidtest1() { structSa[10]; structSb; intd=&a[4] -&a[6]; d=&a[0].x[3] -&a[0].x[1]; d=a[0].y-a[0].x; // warn: 'S.b' and 'S.a' are different objectsd= (char*)&b.y- (char*)&b.x; // warn: different members of the same objectd= (char*)&b.y- (char*)&b; // warn: object of type S is not the same array as a member of it }
There may be existing applications that use code like above for calculating offsets of members in a structure, using pointer subtractions. This is still undefined behavior according to the standard and code like this can be replaced with the offsetof macro.
Finds calls to the putenv
function which pass a pointer to a stack-allocated (automatic) array as the argument. Function putenv
does not copy the passed string, only a pointer to the data is stored and this data can be read even by other threads. Content of a stack-allocated array is likely to be overwritten after exiting from the function.
The problem can be solved by using a static array variable or dynamically allocated memory. Even better is to avoid using putenv
(it has other problems related to memory leaks) and use setenv
instead.
The check corresponds to CERT rule POS34-C. Do not call putenv() with a pointer to an automatic variable as the argument.
intf() { charenv[] ="NAME=value"; returnputenv(env); // putenv function should not be called with stack-allocated string }
There is one case where the checker can report a false positive. This is when the stack-allocated array is used at putenv in a function or code branch that does not return (process is terminated on all execution paths).
Another special case is if the putenv is called from function main. Here the stack is deallocated at the end of the program and it should be no problem to use the stack-allocated string (a multi-threaded program may require more attention). The checker does not warn for cases when stack space of main is used at the putenv call.
When dropping user-level and group-level privileges in a program by using setuid
and setgid
calls, it is important to reset the group-level privileges (with setgid
) first. Function setgid
will likely fail if the superuser privileges are already dropped.
The checker checks for sequences of setuid(getuid())
and setgid(getgid())
calls (in this order). If such a sequence is found and there is no other privilege-changing function call (seteuid
, setreuid
, setresuid
and the GID versions of these) in between, a warning is generated. The checker finds only exactly setuid(getuid())
calls (and the GID versions), not for example if the result of getuid()
is stored in a variable.
voidtest1() { // ...// end of section with elevated privileges// reset privileges (user and group) to normal userif (setuid(getuid()) !=0) { handle_error(); return; } if (setgid(getgid()) !=0) { // warning: A 'setgid(getgid())' call following a 'setuid(getuid())' call is likely to failhandle_error(); return; } // user-ID and group-ID are reset to normal user now// ... }
In the code above the problem is that setuid(getuid())
removes superuser privileges before setgid(getgid())
is called. To fix the problem the setgid(getgid())
should be called first. Further attention is needed to avoid code like setgid(getuid())
(this checker does not detect bugs like this) and always check the return value of these calls.
This check corresponds to SEI CERT Rule POS36-C.
POSIX/Unix checkers.
Check calls to various UNIX/Posix functions: open, pthread_once, calloc, malloc, realloc, alloca
.
.. literalinclude:: checkers/unix_api_example.c :language: c
Check for calls to blocking functions inside a critical section. Blocking functions detected by this checker: sleep, getc, fgets, read, recv
. Critical section handling functions modeled by this checker: lock, unlock, pthread_mutex_lock, pthread_mutex_trylock, pthread_mutex_unlock, mtx_lock, mtx_timedlock, mtx_trylock, mtx_unlock, lock_guard, unique_lock
.
voidpthread_lock_example(pthread_mutex_t*m) { pthread_mutex_lock(m); // note: entering critical section heresleep(10); // warn: Call to blocking function 'sleep' inside of critical sectionpthread_mutex_unlock(m); }
voidoverlapping_critical_sections(mtx_t *m1, std::mutex &m2) { std::lock_guard lg{m2}; // note: entering critical section heremtx_lock(m1); // note: entering critical section heresleep(10); // warn: Call to blocking function 'sleep' inside of critical sectionmtx_unlock(m1); sleep(10); // warn: Call to blocking function 'sleep' inside of critical section// still inside of the critical section of the std::lock_guard }
Limitations
- The
trylock
andtimedlock
versions of acquiring locks are currently assumed to always succeed. This can lead to false positives.
voidtrylock_example(pthread_mutex_t*m) { if (pthread_mutex_trylock(m) ==0) { // assume trylock always succeedssleep(10); // warn: Call to blocking function 'sleep' inside of critical sectionpthread_mutex_unlock(m); } else { sleep(10); // false positive: Incorrect warning about blocking function inside critical section. } }
Check improper use of chroot described by SEI Cert C recommendation POS05-C. Limit access to files by creating a jail. The checker finds usage patterns where chdir("/")
is not called immediately after a call to chroot(path)
.
voidf(); voidtest_bad() { chroot("/usr/local"); f(); // warn: no call of chdir("/") immediately after chroot } voidtest_bad_path() { chroot("/usr/local"); chdir("/usr"); // warn: no call of chdir("/") immediately after chrootf(); } voidtest_good() { chroot("/usr/local"); chdir("/"); // no warningf(); }
Check for improper use of errno
. This checker implements partially CERT rule ERR30-C. Set errno to zero before calling a library function known to set errno, and check errno only after the function returns a value indicating failure. The checker can find the first read of errno
after successful standard function calls.
The C and POSIX standards often do not define if a standard library function may change value of errno
if the call does not fail. Therefore, errno
should only be used if it is known from the return value of a function that the call has failed. There are exceptions to this rule (for example strtol
) but the affected functions are not yet supported by the checker. The return values for the failure cases are documented in the standard Linux man pages of the functions and in the POSIX standard.
intunsafe_errno_read(intsock, void*data, intdata_size) { if (send(sock, data, data_size, 0) !=data_size) { // 'send' can be successful even if not all data was sentif (errno==1) { // An undefined value may be read from 'errno'return0; } } return1; }
The checker :ref:`unix-StdCLibraryFunctions` must be turned on to get the warnings from this checker. The supported functions are the same as by :ref:`unix-StdCLibraryFunctions`. The ModelPOSIX
option of that checker affects the set of checked functions.
Parameters
The AllowErrnoReadOutsideConditionExpressions
option allows read of the errno value if the value is not used in a condition (in if
statements, loops, conditional expressions, switch
statements). For example errno
can be stored into a variable without getting a warning by the checker.
intunsafe_errno_read(intsock, void*data, intdata_size) { if (send(sock, data, data_size, 0) !=data_size) { interr=errno; // warning if 'AllowErrnoReadOutsideConditionExpressions' is false// no warning if 'AllowErrnoReadOutsideConditionExpressions' is true } return1; }
Default value of this option is true
. This allows save of the errno value for possible later error handling.
Limitations
- Only the very first usage of
errno
is checked after an affected function call. Value oferrno
is not followed when it is stored into a variable or returned from a function.- Documentation of function
lseek
is not clear about what happens if the function returns different value than the expected file position but not -1. To avoid possible false-positiveserrno
is allowed to be used in this case.
Check for memory leaks, double free, and use-after-free problems. Traces memory managed by malloc()/free().
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
.. literalinclude:: checkers/unix_malloc_example.c :language: c
Check for dubious malloc
arguments involving sizeof
.
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
voidtest() { long*p=malloc(sizeof(short)); // warn: result is converted to 'long *', which is// incompatible with operand type 'short'free(p); }
Check for mismatched deallocators.
Custom allocation/deallocation functions can be defined using :ref:`ownership attributes<analyzer-ownership-attrs>`.
.. literalinclude:: checkers/mismatched_deallocator_example.cpp :language: c
Check for proper usage of vfork
.
inttest(intx) { pid_tpid=vfork(); // warnif (pid!=0) return0; switch (x) { case0: pid=1; execl("", "", 0); _exit(1); break; case1: x=0; // warn: this assignment is prohibitedbreak; case2: foo(); // warn: this function call is prohibitedbreak; default: return0; // warn: return is prohibited } while(1); }
Check the size argument passed into C string functions for common erroneous patterns. Use -Wno-strncat-size
compiler option to mute other strncat
-related compiler warnings.
voidtest() { chardest[3]; strncat(dest, """""""""""""""""""""""""*", sizeof(dest)); // warn: potential buffer overflow }
Check for arguments which are not null-terminated strings; applies to the strlen
, strcpy
, strcat
, strcmp
family of functions.
Only very fundamental cases are detected where the passed memory block is absolutely different from a null-terminated string. This checker does not find if a memory buffer is passed where the terminating zero character is missing.
voidtest1() { intl=strlen((char*)&test1); // warn } voidtest2() { label: intl=strlen((char*)&&label); // warn }
Check for null pointers being passed as arguments to C string functions: strlen, strnlen, strcpy, strncpy, strcat, strncat, strcmp, strncmp, strcasecmp, strncasecmp, wcslen, wcsnlen
.
inttest() { returnstrlen(0); // warn }
Check for calls of standard library functions that violate predefined argument constraints. For example, according to the C standard the behavior of function int isalnum(int ch)
is undefined if the value of ch
is not representable as unsigned char
and is not equal to EOF
.
You can think of this checker as defining restrictions (pre- and postconditions) on standard library functions. Preconditions are checked, and when they are violated, a warning is emitted. Postconditions are added to the analysis, e.g. that the return value of a function is not greater than 255. Preconditions are added to the analysis too, in the case when the affected values are not known before the call.
For example, if an argument to a function must be in between 0 and 255, but the value of the argument is unknown, the analyzer will assume that it is in this interval. Similarly, if a function mustn't be called with a null pointer and the analyzer cannot prove that it is null, then it will assume that it is non-null.
- These are the possible checks on the values passed as function arguments:
- The argument has an allowed range (or multiple ranges) of values. The checker can detect if a passed value is outside of the allowed range and show the actual and allowed values.
- The argument has pointer type and is not allowed to be null pointer. Many (but not all) standard functions can produce undefined behavior if a null pointer is passed, these cases can be detected by the checker.
- The argument is a pointer to a memory block and the minimal size of this buffer is determined by another argument to the function, or by multiplication of two arguments (like at function
fread
), or is a fixed value (for exampleasctime_r
requires at least a buffer of size 26). The checker can detect if the buffer size is too small and in optimal case show the size of the buffer and the values of the corresponding arguments.
#defineEOF -1 voidtest_alnum_concrete(intv) { intret=isalnum(256); // \ // warning: Function argument outside of allowed range (void)ret; } voidbuffer_size_violation(FILE*file) { enum { BUFFER_SIZE=1024 }; wchar_twbuf[BUFFER_SIZE]; constsize_tsize=sizeof(*wbuf); // 4constsize_tnitems=sizeof(wbuf); // 4096// Below we receive a warning because the 3rd parameter should be the// number of elements to read, not the size in bytes. This case is a known// vulnerability described by the ARR38-C SEI-CERT rule.fread(wbuf, size, nitems, file); } inttest_alnum_symbolic(intx) { intret=isalnum(x); // after the call, ret is assumed to be in the range [-1, 255]if (ret>255) // impossible (infeasible branch)if (x==0) returnret / x; // division by zero is not reportedreturnret; }
Additionally to the argument and return value conditions, this checker also adds state of the value errno
if applicable to the analysis. Many system functions set the errno
value only if an error occurs (together with a specific return value of the function), otherwise it becomes undefined. This checker changes the analysis state to contain such information. This data is used by other checkers, for example :ref:`unix-Errno`.
Limitations
The checker can not always provide notes about the values of the arguments. Without this information it is hard to confirm if the constraint is indeed violated. The argument values are shown if they are known constants or the value is determined by previous (not too complicated) assumptions.
The checker can produce false positives in cases such as if the program has invariants not known to the analyzer engine or the bug report path contains calls to unknown functions. In these cases the analyzer fails to detect the real range of the argument.
Parameters
The ModelPOSIX
option controls if functions from the POSIX standard are recognized by the checker.
With ModelPOSIX=true
, many POSIX functions are modeled according to the POSIX standard. This includes ranges of parameters and possible return values. Furthermore the behavior related to errno
in the POSIX case is often that errno
is set only if a function call fails, and it becomes undefined after a successful function call.
With ModelPOSIX=false
, this checker follows the C99 language standard and only models the functions that are described there. It is possible that the same functions are modeled differently in the two cases because differences in the standards. The C standard specifies less aspects of the functions, for example exact errno
behavior is often unspecified (and not modeled by the checker).
Default value of the option is true
.
Check C stream handling functions: fopen, fdopen, freopen, tmpfile, fclose, fread, fwrite, fgetc, fgets, fputc, fputs, fprintf, fscanf, ungetc, getdelim, getline, fseek, fseeko, ftell, ftello, fflush, rewind, fgetpos, fsetpos, clearerr, feof, ferror, fileno
.
The checker maintains information about the C stream objects (FILE *
) and can detect error conditions related to use of streams. The following conditions are detected:
- The
FILE *
pointer passed to the function is NULL (the single exception isfflush
where NULL is allowed). - Use of stream after close.
- Opened stream is not closed.
- Read from a stream after end-of-file. (This is not a fatal error but reported by the checker. Stream remains in EOF state and the read operation fails.)
- Use of stream when the file position is indeterminate after a previous failed operation. Some functions (like
ferror
,clearerr
,fseek
) are allowed in this state. - Invalid 3rd ("
whence
") argument tofseek
.
The stream operations are by this checker usually split into two cases, a success and a failure case. On the success case it also assumes that the current value of stdout
, stderr
, or stdin
can't be equal to the file pointer returned by fopen
. Operations performed on stdout
, stderr
, or stdin
are not checked by this checker in contrast to the streams opened by fopen
.
In the case of write operations (like fwrite
, fprintf
and even fsetpos
) this behavior could produce a large amount of unwanted reports on projects that don't have error checks around the write operations, so by default the checker assumes that write operations always succeed. This behavior can be controlled by the Pedantic
flag: With -analyzer-config unix.Stream:Pedantic=true
the checker will model the cases where a write operation fails and report situations where this leads to erroneous behavior. (The default is Pedantic=false
, where write operations are assumed to succeed.)
voidtest1() { FILE*p=fopen("foo", "r"); } // warn: opened file is never closedvoidtest2() { FILE*p=fopen("foo", "r"); fseek(p, 1, SEEK_SET); // warn: stream pointer might be NULLfclose(p); } voidtest3() { FILE*p=fopen("foo", "r"); if (p) { fseek(p, 1, 3); // warn: third arg should be SEEK_SET, SEEK_END, or SEEK_CURfclose(p); } } voidtest4() { FILE*p=fopen("foo", "r"); if (!p) return; fclose(p); fclose(p); // warn: stream already closed } voidtest5() { FILE*p=fopen("foo", "r"); if (!p) return; fgetc(p); if (!ferror(p)) fgetc(p); // warn: possible read after end-of-filefclose(p); } voidtest6() { FILE*p=fopen("foo", "r"); if (!p) return; fgetc(p); if (!feof(p)) fgetc(p); // warn: file position may be indeterminate after I/O errorfclose(p); }
Limitations
The checker does not track the correspondence between integer file descriptors and FILE *
pointers.
macOS checkers.
Check for proper uses of various Apple APIs.
voidtest() { dispatch_once_t pred = 0; dispatch_once(&pred, ^(){}); // warn: dispatch_once uses local }
Check for erroneous conversions of objects representing numbers into numbers.
NSNumber *photoCount = [albumDescriptor objectForKey:@"PhotoCount"]; // Warning: Comparing a pointer value of type 'NSNumber *'// to a scalar integer valueif (photoCount > 0) { [selfdisplayPhotos]; }
Check for proper uses of Objective-C properties.
NSNumber *photoCount = [albumDescriptor objectForKey:@"PhotoCount"]; // Warning: Comparing a pointer value of type 'NSNumber *'// to a scalar integer valueif (photoCount > 0) { [selfdisplayPhotos]; }
Check for proper uses of Secure Keychain APIs.
.. literalinclude:: checkers/seckeychainapi_example.m :language: objc
Check for nil pointers used as mutexes for @synchronized.
voidtest(id x) { if (!x) @synchronized(x) {} // warn: nil value used as mutex } voidtest() { id y; @synchronized(y) {} // warn: uninitialized value used as mutex }
Warn about potentially crashing writes to autoreleasing objects from different autoreleasing pools in Objective-C.
Check for sending 'retain', 'release', or 'autorelease' directly to a Class.
@interfaceMyClass : NSObject@endvoidtest(void) { [MyClass release]; // warn }
Warn about Objective-C classes that lack a correct implementation of -dealloc
.. literalinclude:: checkers/dealloc_example.m :language: objc
Warn about Objective-C method signatures with type incompatibilities.
@interfaceMyClass1 : NSObject - (int)foo; @end@implementationMyClass1 - (int)foo { return1; } @end@interfaceMyClass2 : MyClass1 - (float)foo; @end@implementationMyClass2 - (float)foo { return1.0; } // warn@end
Improved modeling of loops using Cocoa collection types.
Warn about Objective-C methods that lack a necessary call to super.
@interfaceTest : UIViewController@end@implementationtest - (void)viewDidLoad {} // warn@end
Warn for suboptimal uses of NSAutoreleasePool in Objective-C GC mode.
voidtest() { NSAutoreleasePool *pool = [[NSAutoreleasePoolalloc] init]; [pool release]; // warn }
Check usage of NSError parameters.
@interfaceA : NSObject - (void)foo:(NSError"""""""""""""""""""""""")error; @end@implementationA - (void)foo:(NSError"""""""""""""""""""""""")error { // warn: method accepting NSError"""""""""""""""""""""""" should have a non-void// return value } @end@interfaceA : NSObject - (BOOL)foo:(NSError"""""""""""""""""""""""")error; @end@implementationA - (BOOL)foo:(NSError"""""""""""""""""""""""")error { *error = 0; // warn: potential null dereferencereturn0; } @end
Check for prohibited nil arguments to ObjC method calls.
- caseInsensitiveCompare:
- compare:
- compare:options:
- compare:options:range:
- compare:options:range:locale:
- componentsSeparatedByCharactersInSet:
- initWithFormat:
NSComparisonResulttest(NSString *s) { NSString *aString = nil; return [s caseInsensitiveCompare:aString]; // warn: argument to 'NSString' method// 'caseInsensitiveCompare:' cannot be nil }
Models the APIs that are guaranteed to return a non-nil value.
Check for type errors when using Objective-C generics.
NSMutableArray *names = [NSMutableArrayarray]; NSMutableArray *birthDates = names; // Warning: Conversion from value of type 'NSDate *'// to incompatible type 'NSString *' [birthDates addObject: [NSDatedate]];
Check for leaks and improper reference count management
voidtest() { NSString *s = [[NSStringalloc] init]; // warn } CFStringReftest(char *bytes) { returnCFStringCreateWithCStringNoCopy( 0, bytes, NSNEXTSTEPStringEncoding, 0); // warn }
Check for leaked memory in autorelease pools that will never be drained.
Check that 'self' is properly initialized inside an initializer method.
@interfaceMyObj : NSObject { id x; } - (id)init; @end@implementationMyObj - (id)init { [superinit]; x = 0; // warn: instance variable used while 'self' is not// initializedreturn0; } @end@interfaceMyObj : NSObject - (id)init; @end@implementationMyObj - (id)init { [superinit]; return self; // warn: returning uninitialized 'self' } @end
Warn about improper use of '[super dealloc]' in Objective-C.
@interfaceSuperDeallocThenReleaseIvarClass : NSObject { NSObject *_ivar; } @end@implementationSuperDeallocThenReleaseIvarClass - (void)dealloc { [superdealloc]; [_ivar release]; // warn } @end
Warn about private ivars that are never used.
@interfaceMyObj : NSObject { @privateid x; // warn } @end@implementationMyObj@end
Check for passing non-Objective-C types to variadic collection initialization methods that expect only Objective-C types.
voidtest() { [NSSetsetWithObjects:@"Foo", "Bar", nil]; // warn: argument should be an ObjC pointer type, not 'char *' }
Check usage of CFErrorRef* parameters
voidtest(CFErrorRef*error) { // warn: function accepting CFErrorRef* should have a// non-void return } intfoo(CFErrorRef*error) { *error=0; // warn: potential null dereferencereturn0; }
Check for proper uses of CFNumber APIs.
CFNumberReftest(unsigned charx) { returnCFNumberCreate(0, kCFNumberSInt16Type, &x); // warn: 8 bit integer is used to initialize a 16 bit integer }
Check for null arguments to CFRetain/CFRelease/CFMakeCollectable.
voidtest(CFTypeRefp) { if (!p) CFRetain(p); // warn } voidtest(intx, CFTypeRefp) { if (p) return; CFRelease(p); // warn }
Checks for index out-of-bounds when using 'CFArray' API.
voidtest() { CFArrayRefA=CFArrayCreate(0, 0, 0, &kCFTypeArrayCallBacks); CFArrayGetValueAtIndex(A, 0); // warn }
Warns if 'CFArray', 'CFDictionary', 'CFSet' are created with non-pointer-size values.
voidtest() { intx[] = { 1 }; CFArrayRefA=CFArrayCreate(0, (constvoid"""""""""""""""""""""""")x, 1, &kCFTypeArrayCallBacks); // warn }
Fuchsia is an open source capability-based operating system currently being developed by Google. This section describes checkers that can find various misuses of Fuchsia APIs.
Handles identify resources. Similar to pointers they can be leaked, double freed, or use after freed. This check attempts to find such problems.
voidcheckLeak08(int tag) { zx_handle_t sa, sb; zx_channel_create(0, &sa, &sb); if (tag) zx_handle_close(sa); use(sb); // Warn: Potential leak of handlezx_handle_close(sb); }
WebKit is an open-source web browser engine available for macOS, iOS and Linux. This section describes checkers that can find issues in WebKit codebase.
Most of the checkers focus on memory management for which WebKit uses custom implementation of reference counted smartpointers.
- Checkers are formulated in terms related to ref-counting:
- Ref-counted type is either
Ref<T>
orRefPtr<T>
. - Ref-countable type is any type that implements
ref()
andderef()
methods asRefPtr<>
is a template (i. e. relies on duck typing). - Uncounted type is ref-countable but not ref-counted type.
- Ref-counted type is either
All uncounted types used as base classes must have a virtual destructor.
Ref-counted types hold their ref-countable data by a raw pointer and allow implicit upcasting from ref-counted pointer to derived type to ref-counted pointer to base type. This might lead to an object of (dynamic) derived type being deleted via pointer to the base class type which C++ standard defines as UB in case the base class doesn't have virtual destructor [expr.delete]
.
structRefCntblBase { voidref() {} voidderef() {} }; structDerived : RefCntblBase { }; // warn
Raw pointers and references to uncounted types can't be used as class members. Only ref-counted types are allowed.
structRefCntbl { voidref() {} voidderef() {} }; structFoo { RefCntbl * ptr; // warn RefCntbl & ptr; // warn// ... };
Raw pointers and references to uncounted types can't be captured in lambdas. Only ref-counted types are allowed.
structRefCntbl { voidref() {} voidderef() {} }; voidfoo(RefCntbl* a, RefCntbl& b) { [&, a](){ // warn about 'a'do_something(b); // warn about 'b' }; };
These are checkers with known issues or limitations that keep them from being on by default. They are likely to have false positives. Bug reports and especially patches are welcome.
Reports similar pieces of code.
voidlog(); intmax(inta, intb) { // warnlog(); if (a>b) returna; returnb; } intmaxClone(intx, inty) { // similar code herelog(); if (x>y) returnx; returny; }
Warn about assigning non-{0,1} values to boolean variables.
voidtest() { BOOL b = -1; // warn }
Similarly to :ref:`alpha.unix.PthreadLock <alpha-unix-PthreadLock>`, checks for the locking/unlocking of mtx_t
mutexes.
mtx_t mtx1; voidbad1(void) { mtx_lock(&mtx1); mtx_lock(&mtx1); // warn: This lock has already been acquired }
Check when casting a malloc'ed type T
, whether the size is a multiple of the size of T
.
voidtest() { int*x= (int*) malloc(11); // warn }
Check for cast from non-struct pointer to struct pointer.
// Cstructs {}; voidtest(int *p) { structs *ps = (structs *) p; // warn } // C++classc {}; voidtest(int *p) { c *pc = (c *) p; // warn }
Loss of sign/precision in implicit conversions.
voidtest(unsignedU, signedS) { if (S>10) { if (U<S) { } } if (S<-10) { if (U<S) { // warn (loss of sign) } } } voidtest() { long longA=1LL << 60; shortX=A; // warn (loss of precision) }
Check for cases where the dynamic and the static type of an object are unrelated.
id date = [NSDatedate]; // Warning: Object has a dynamic type 'NSDate *' which is// incompatible with static type 'NSNumber *'"NSNumber *number = date; [number doubleValue];
Check for assignment of a fixed address to a pointer.
voidtest() { int*p; p= (int*) 0x10000; // warn }
Check for pointer arithmetic on locations other than array elements.
voidtest() { intx; int*p; p=&x+1; // warn }
Check that addresses to stack memory do not escape the function that involves dispatch_after or dispatch_async. This checker is a part of core.StackAddressEscape
, but is temporarily disabled until some false positives are fixed.
dispatch_block_ttest_block_inside_block_async_leak() { intx=123; void (^inner)(void) = ^void(void) { inty=x; ++y; }; void (^outer)(void) = ^void(void) { intz=x; ++z; inner(); }; returnouter; // warn: address of stack-allocated block is captured by a// returned block }
Check if a value of active type is retrieved from an std::variant
instance with std::get
. In case of bad variant type access (the accessed type differs from the active type) a warning is emitted. Currently, this checker does not take exception handling into account.
voidtest() { std::variant<int, char> v = 25; char c = stg::get<char>(v); // warn: "int" is the active alternative }
Check for division by variable that is later compared against 0. Either the comparison is useless or there is division by zero.
voidtest(intx) { var=77 / x; if (x==0) { } // warn }
Reports destructions of polymorphic objects with a non-virtual destructor in their base class.
classNonVirtual {}; classNVDerived : publicNonVirtual {}; NonVirtual *create() { NonVirtual *x = newNVDerived(); // note: Casting from 'NVDerived' to// 'NonVirtual' herereturn x; } voidfoo() { NonVirtual *x = create(); delete x; // warn: destruction of a polymorphic object with no virtual// destructor }
Check for use of invalidated iterators.
voidbad_copy_assign_operator_list1(std::list &L1, const std::list &L2) { auto i0 = L1.cbegin(); L1 = L2; *i0; // warn: invalidated iterator accessed }
Check for iterators used outside their valid ranges.
voidsimple_bad_end(const std::vector &v) { auto i = v.end(); *i; // warn: iterator accessed outside of its range }
Check for use of iterators of different containers where iterators of the same container are expected.
voidbad_insert3(std::vector &v1, std::vector &v2) { v2.insert(v1.cbegin(), v2.cbegin(), v2.cend()); // warn: container accessed// using foreign// iterator argument v1.insert(v1.cbegin(), v1.cbegin(), v2.cend()); // warn: iterators of// different containers// used where the same// container is// expected v1.insert(v1.cbegin(), v2.cbegin(), v1.cend()); // warn: iterators of// different containers// used where the same// container is// expected }
Check for dereference of null smart pointers.
voidderef_smart_ptr() { std::unique_ptr<int> P; *P; // warn: dereference of a default constructed smart unique_ptr }
Check unreachable code.
// Cinttest() { int x = 1; while(x); return x; // warn } // C++voidtest() { int a = 2; while (a > 1) a--; if (a > 1) a++; // warn } // Objective-Cvoidtest(id x) { return; [x retain]; // warn }
Similarly to :ref:`alpha.unix.PthreadLock <alpha-unix-PthreadLock>`, checks for the locking/unlocking of fuchsia mutexes.
spin_lock_t mtx1; voidbad1(void) { spin_lock(&mtx1); spin_lock(&mtx1); // warn: This lock has already been acquired }
Check code for LLVM codebase conventions:
- A StringRef should not be bound to a temporary std::string whose lifetime is shorter than the StringRef's.
- Clang AST nodes should not have fields that can allocate memory.
Check for direct assignments to instance variables.
@interfaceMyClass : NSObject {} @property (readonly) id A; - (void) foo; @end@implementationMyClass - (void) foo { _A = 0; // warn } @end
Check for direct assignments to instance variables in the methods annotated with objc_no_direct_instance_variable_assignment
.
@interfaceMyClass : NSObject {} @property (readonly) id A; - (void) fAnnotated __attribute__(( annotate("objc_no_direct_instance_variable_assignment"))); - (void) fNotAnnotated; @end@implementationMyClass - (void) fAnnotated { _A = 0; // warn } - (void) fNotAnnotated { _A = 0; // no warn } @end
Check that the invalidatable instance variables are invalidated in the methods annotated with objc_instance_variable_invalidator.
@protocolInvalidation <NSObject> - (void) invalidate __attribute__((annotate("objc_instance_variable_invalidator"))); @end @interfaceInvalidationImpObj : NSObject <Invalidation> @end@interfaceSubclassInvalidationImpObj : InvalidationImpObj { InvalidationImpObj *var; } - (void)invalidate; @end@implementationSubclassInvalidationImpObj - (void) invalidate {} @end// warn: var needs to be invalidated or set to nil
Check that the invalidation methods are present in classes that contain invalidatable instance variables.
@protocolInvalidation <NSObject> - (void)invalidate __attribute__((annotate("objc_instance_variable_invalidator"))); @end @interfaceNeedInvalidation : NSObject <Invalidation> @end@interfaceMissingInvalidationMethodDecl : NSObject { NeedInvalidation *Var; // warn } @end@implementationMissingInvalidationMethodDecl@end
Warns against using one vs. many plural pattern in code when generating localized strings.
NSString *reminderText = NSLocalizedString(@"None", @"Indicates no reminders"); if (reminderCount == 1) { // Warning: Plural cases are not supported across all languages.// Use a .stringsdict file instead reminderText = NSLocalizedString(@"1 Reminder", @"Indicates single reminder"); } elseif (reminderCount >= 2) { // Warning: Plural cases are not supported across all languages.// Use a .stringsdict file instead reminderText = [NSStringstringWithFormat:NSLocalizedString(@"%@ Reminders", @"Indicates multiple reminders"), reminderCount]; }
Check for an out-of-bound pointer being returned to callers.
staticintA[10]; int*test() { int*p=A+10; returnp; // warn } inttest(void) { intx; returnx; // warn: undefined or garbage returned }
SEI CERT checkers which tries to find errors based on their C coding rules.
Simple lock -> unlock checker. Applies to: pthread_mutex_lock, pthread_rwlock_rdlock, pthread_rwlock_wrlock, lck_mtx_lock, lck_rw_lock_exclusive
lck_rw_lock_shared, pthread_mutex_trylock, pthread_rwlock_tryrdlock, pthread_rwlock_tryrwlock, lck_mtx_try_lock, lck_rw_try_lock_exclusive, lck_rw_try_lock_shared, pthread_mutex_unlock, pthread_rwlock_unlock, lck_mtx_unlock, lck_rw_done
.
pthread_mutex_tmtx; voidtest() { pthread_mutex_lock(&mtx); pthread_mutex_lock(&mtx); // warn: this lock has already been acquired } lck_mtx_tlck1, lck2; voidtest() { lck_mtx_lock(&lck1); lck_mtx_lock(&lck2); lck_mtx_unlock(&lck1); // warn: this was not the most recently acquired lock } lck_mtx_tlck1, lck2; voidtest() { if (lck_mtx_try_lock(&lck1) ==0) return; lck_mtx_lock(&lck2); lck_mtx_unlock(&lck1); // warn: this was not the most recently acquired lock }
Check for misuses of stream APIs. Check for misuses of stream APIs: fopen, fclose
(demo checker, the subject of the demo (Slides , Video) by Anna Zaks and Jordan Rose presented at the 2012 LLVM Developers' Meeting).
voidtest() { FILE*F=fopen("myfile.txt", "w"); } // warn: opened file is never closedvoidtest() { FILE*F=fopen("myfile.txt", "w"); if (F) fclose(F); fclose(F); // warn: closing a previously closed file stream }
Checks for overlap in two buffer arguments. Applies to: memcpy, mempcpy, wmemcpy, wmempcpy
.
voidtest() { inta[4] = {0}; memcpy(a+2, a+1, 8); // warn }
Check for out-of-bounds access in string functions, such as: memcpy, bcopy, strcpy, strncpy, strcat, strncat, memmove, memcmp, memset
and more.
This check also works with string literals, except there is a known bug in that the analyzer cannot detect embedded NULL characters when determining the string length.
voidtest1() { constcharstr[] ="Hello world"; charbuffer[] ="Hello world"; memcpy(buffer, str, sizeof(str) +1); // warn } voidtest2() { constcharstr[] ="Hello world"; charbuffer[] ="Helloworld"; memcpy(buffer, str, sizeof(str)); // warn }
- Check for uninitialized reads from common memory copy/manipulation functions such as:
memcpy, mempcpy, memmove, memcmp, strcmp, strncmp, strcpy, strlen, strsep
and many more.
voidtest() { charsrc[10]; chardst[5]; memcpy(dst,src,sizeof(dst)); // warn: Bytes string function accesses uninitialized/garbage values }
Limitations:
Due to limitations of the memory modeling in the analyzer, one can likely observe a lot of false-positive reports like this:
voidfalse_positive() { intsrc[] = {1, 2, 3, 4}; intdst[5] = {0}; memcpy(dst, src, 4*sizeof(int)); // false-positive:// The 'src' buffer was correctly initialized, yet we cannot conclude// that since the analyzer could not see a direct initialization of the// very last byte of the source buffer. }More details at the corresponding GitHub issue.
Check for local variables, member variables, and function arguments that are forward declared.
structObj; Obj* provide(); structFoo { Obj* ptr; // warn }; voidfoo() { Obj* obj = provide(); // warnconsume(obj); // warn }
Check for all casts from a base type to its derived type as these might be memory-unsafe.
Example:
classBase { }; classDerived : publicBase { }; voidf(Base* base) { Derived* derived = static_cast<Derived*>(base); // ERROR }
For all cast operations (C-style casts, static_cast, reinterpret_cast, dynamic_cast), if the source type a Base* and the destination type is Derived*, where Derived inherits from Base, the static analyzer should signal an error.
This applies to:
- C structs, C++ structs and classes, and Objective-C classes and protocols.
- Pointers and references.
- Inside template instantiations and macro expansions that are visible to the compiler.
For types like this, instead of using built in casts, the programmer will use helper functions that internally perform the appropriate type check and disable static analysis.
Raw pointers and references to an object which supports CheckedPtr or CheckedRef can't be used as class members. Only CheckedPtr, CheckedRef, RefPtr, or Ref are allowed.
structCheckableObj { voidincrementCheckedPtrCount() {} voiddecrementCheckedPtrCount() {} }; structFoo { CheckableObj* ptr; // warn CheckableObj& ptr; // warn// ... };
See WebKit Guidelines for Safer C++ Programming for details.
Raw pointers and references to a NS or CF object can't be used as class members or ivars. Only RetainPtr is allowed for CF types regardless of whether ARC is enabled or disabled. Only RetainPtr is allowed for NS types when ARC is disabled.
structFoo { NSObject *ptr; // warn// ... };
See WebKit Guidelines for Safer C++ Programming for details.
Raw pointers and references to NS or CF types can't be captured in lambdas. Only RetainPtr is allowed for CF types regardless of whether ARC is enabled or disabled, and only RetainPtr is allowed for NS types when ARC is disabled.
voidfoo(NSObject *a, NSObject *b) { [&, a](){ // warn about 'a'do_something(b); // warn about 'b' }; };
The goal of this rule is to make sure that lifetime of any dynamically allocated ref-countable object passed as a call argument spans past the end of the call. This applies to call to any function, method, lambda, function pointer or functor. Ref-countable types aren't supposed to be allocated on stack so we check arguments for parameters of raw pointers and references to uncounted types.
Here are some examples of situations that we warn about as they might be potentially unsafe. The logic is that either we're able to guarantee that an argument is safe or it's considered if not a bug then bug-prone.
RefCountable* provide_uncounted(); voidconsume(RefCountable*); // In these cases we can't make sure callee won't directly or indirectly call `deref()` on the argument which could make it unsafe from such point until the end of the call.voidfoo1() { consume(provide_uncounted()); // warn } voidfoo2() { RefCountable* uncounted = provide_uncounted(); consume(uncounted); // warn }
Although we are enforcing member variables to be ref-counted by webkit.NoUncountedMemberChecker any method of the same class still has unrestricted access to these. Since from a caller's perspective we can't guarantee a particular member won't get modified by callee (directly or indirectly) we don't consider values obtained from members safe.
Note: It's likely this heuristic could be made more precise with fewer false positives - for example calls to free functions that don't have any parameter other than the pointer should be safe as the callee won't be able to tamper with the member unless it's a global variable.
structFoo { RefPtr<RefCountable> member; voidconsume(RefCountable*) { /* ... */ } voidbugprone() { consume(member.get()); // warn } };
The implementation of this rule is a heuristic - we define a whitelist of kinds of values that are considered safe to be passed as arguments. If we can't prove an argument is safe it's considered an error.
Allowed kinds of arguments:
values obtained from ref-counted objects (including temporaries as those survive the call too)
RefCountable* provide_uncounted(); voidconsume(RefCountable*); voidfoo() { RefPtr<RefCountable> rc = makeRef(provide_uncounted()); consume(rc.get()); // okconsume(makeRef(provide_uncounted()).get()); // ok }
forwarding uncounted arguments from caller to callee
voidfoo(RefCountable& a) { bar(a); // ok }
Caller of
foo()
is responsible fora
's lifetime.this
pointervoidFoo::foo() { baz(this); // ok }
Caller of
foo()
is responsible for keeping the memory pointed to bythis
pointer safe.constants
foo(nullptr, NULL, 0); // ok
We also define a set of safe transformations which if passed a safe value as an input provide (usually it's the return value) a safe value (or an object that provides safe values). This is also a heuristic.
- constructors of ref-counted types (including factory methods)
- getters of ref-counted types
- member overloaded operators
- casts
- unary operators like
&
or*
The goal of this rule is to make sure that lifetime of any dynamically allocated CheckedPtr capable object passed as a call argument keeps its memory region past the end of the call. This applies to call to any function, method, lambda, function pointer or functor. CheckedPtr capable objects aren't supposed to be allocated on stack so we check arguments for parameters of raw pointers and references to unchecked types.
The rules of when to use and not to use CheckedPtr / CheckedRef are same as alpha.webkit.UncountedCallArgsChecker for ref-counted objects.
The goal of this rule is to make sure that lifetime of any dynamically allocated NS or CF objects passed as a call argument keeps its memory region past the end of the call. This applies to call to any function, method, lambda, function pointer or functor. NS or CF objects aren't supposed to be allocated on stack so we check arguments for parameters of raw pointers and references to unretained types.
The rules of when to use and not to use RetainPtr are same as alpha.webkit.UncountedCallArgsChecker for ref-counted objects.
The goal of this rule is to make sure that any uncounted local variable is backed by a ref-counted object with lifetime that is strictly larger than the scope of the uncounted local variable. To be on the safe side we require the scope of an uncounted variable to be embedded in the scope of ref-counted object that backs it.
These are examples of cases that we consider safe:
voidfoo1() { RefPtr<RefCountable> counted; // The scope of uncounted is EMBEDDED in the scope of counted. { RefCountable* uncounted = counted.get(); // ok } } voidfoo2(RefPtr<RefCountable> counted_param) { RefCountable* uncounted = counted_param.get(); // ok } voidFooClass::foo_method() { RefCountable* uncounted = this; // ok }
Here are some examples of situations that we warn about as they might be potentially unsafe. The logic is that either we're able to guarantee that a local variable is safe or it's considered unsafe.
voidfoo1() { RefCountable* uncounted = new RefCountable; // warn } RefCountable* global_uncounted; voidfoo2() { RefCountable* uncounted = global_uncounted; // warn } voidfoo3() { RefPtr<RefCountable> counted; // The scope of uncounted is not EMBEDDED in the scope of counted. RefCountable* uncounted = counted.get(); // warn }
The goal of this rule is to make sure that any unchecked local variable is backed by a CheckedPtr or CheckedRef with lifetime that is strictly larger than the scope of the unchecked local variable. To be on the safe side we require the scope of an unchecked variable to be embedded in the scope of CheckedPtr/CheckRef object that backs it.
These are examples of cases that we consider safe:
voidfoo1() { CheckedPtr<RefCountable> counted; // The scope of uncounted is EMBEDDED in the scope of counted. { RefCountable* uncounted = counted.get(); // ok } } voidfoo2(CheckedPtr<RefCountable> counted_param) { RefCountable* uncounted = counted_param.get(); // ok } voidFooClass::foo_method() { RefCountable* uncounted = this; // ok }
Here are some examples of situations that we warn about as they might be potentially unsafe. The logic is that either we're able to guarantee that a local variable is safe or it's considered unsafe.
voidfoo1() { RefCountable* uncounted = new RefCountable; // warn } RefCountable* global_uncounted; voidfoo2() { RefCountable* uncounted = global_uncounted; // warn } voidfoo3() { RefPtr<RefCountable> counted; // The scope of uncounted is not EMBEDDED in the scope of counted. RefCountable* uncounted = counted.get(); // warn }
The goal of this rule is to make sure that any NS or CF local variable is backed by a RetainPtr with lifetime that is strictly larger than the scope of the unretained local variable. To be on the safe side we require the scope of an unretained variable to be embedded in the scope of Retainptr object that backs it.
The rules of when to use and not to use RetainPtr are same as alpha.webkit.UncountedCallArgsChecker for ref-counted objects.
These are examples of cases that we consider safe:
voidfoo1() { RetainPtr<NSObject> retained; // The scope of unretained is EMBEDDED in the scope of retained. { NSObject* unretained = retained.get(); // ok } } voidfoo2(RetainPtr<NSObject> retained_param) { NSObject* unretained = retained_param.get(); // ok } voidFooClass::foo_method() { NSObject* unretained = this; // ok }
Here are some examples of situations that we warn about as they might be potentially unsafe. The logic is that either we're able to guarantee that a local variable is safe or it's considered unsafe.
voidfoo1() { NSObject* unretained = [[NSObject alloc] init]; // warn } NSObject* global_unretained; voidfoo2() { NSObject* unretained = global_unretained; // warn } voidfoo3() { RetainPtr<NSObject> retained; // The scope of unretained is not EMBEDDED in the scope of retained. NSObject* unretained = retained.get(); // warn }
The goal of this rule is to make sure the constructor of RetainPtr as well as adoptNS and adoptCF are used correctly. When creating a RetainPtr with +1 semantics, adoptNS or adoptCF should be used, and in +0 semantics, RetainPtr constructor should be used. Warn otherwise.
These are examples of cases that we consider correct:
RetainPtr ptr = adoptNS([[NSObject alloc] init]); // ok RetainPtr ptr = CGImageGetColorSpace(image); // ok
Here are some examples of cases that we consider incorrect use of RetainPtr constructor and adoptCF
RetainPtr ptr = [[NSObject alloc] init]; // warnauto ptr = adoptCF(CGImageGetColorSpace(image)); // warn
Checkers used for debugging the analyzer. :doc:`developer-docs/DebugChecks` page contains a detailed description.
Print callbacks that are called during analysis in order.
Dump config table.
Control-Flow Graphs.
Display Call Graph.
Print calls as they are traversed by the engine.
Print the dominance tree for a given CFG.
Print results of live variable analysis.
Print branch conditions as they are traversed by the engine.
Check the analyzer's understanding of expressions.
Emit warnings with analyzer statistics.
Mark tainted symbols as such.
View Control-Flow Graphs using GraphViz.
View Call Graph using GraphViz.
View Exploded Graphs using GraphViz.