Deserialization of untrusted data

Last revision (mm/dd/yy): //

Vulnerabilities Table of Contents

Description
Data which is untrusted cannot be trusted to be well formed. Malformed data or unexpected data could be used to abuse application logic, deny service, or execute arbitrary code, when deserialized.

Consequences


 * Availability: The logic of deserialization could be abused to create recursive object graphs or never provide data expected to terminate reading.
 * Authorization: Potentially code could make assumptions that information in the deserialized object about the data is valid. Functions which make this dangerous assumption could be exploited.
 * Access control (instruction processing): malicious objects can abuse the logic of custom deserializers in order to affect code execution.

Exposure period


 * Requirements specification: A deserialization library could be used which provides a cryptographic framework to seal serialized data.
 * Implementation: Not using the safe deserialization/serializing data features of a language can create data integrity problems.
 * Implementation: Not using the protection accessor functions of an object can cause data integrity problems
 * Implementation: Not protecting your objects from default overloaded functions - which may provide for raw output streams of objects - may cause data confidentiality problems.
 * Implementation: Not making fields transient can often cause data confidentiality problems.

Platform


 * Languages: C, C++, Java, Python, Ruby (and probably others)
 * Operating platforms: Any

Required resources

Any

Severity

High

Likelihood of exploit

Medium

It is often convenient to serialize objects for convenient communication or to save them for later use. However, deserialized data or code can often be modified without using the provided accessor functions if it does not use cryptography to protect itself. Furthermore, any cryptography would still be client-side security - which is of course a dangerous security assumption.

An attempt to serialize and then deserialize a class containing transient fields will result in NULLs where the non-transient data should be. This is an excellent way to prevent time, environment-based, or sensitive variables from being carried over and used improperly.

Risk Factors

 * Does the deserialization take place before authentication?
 * Does the deserialization limit which types can be deserialized?
 * Does the deserialization host have types available which can be repurposed towards malicious ends? Sometimes, these types are called "gadgets", considering their similarity to abusable bits of code that already exist in machine code in Return-Oriented-Programming attacks.

Examples
The following is an example from Adobe's BlazeDS AMF deserialization vulnerability (CVE-2011-2092). You can specify arbitrary classes and properties for a BlazeDS application to deserialize. This particular payload creates an instance of a JFrame object on the target server. The created JFrame object will have a "defaultCloseOperation" of value 3 -- which indicates that the JVM should exit when this JFrame window is closed.

[RemoteClass(alias="javax.swing.JFrame")] public class JFrame { public var title:String = "Gotcha!"; public var defaultCloseOperation:int = 3; public var visible:Boolean = true; }

The next example is one that is much more likely to be seen in custom code. This code reads an object from an untrusted source, and then casts it to an AcmeObject:

InputStream is = request.getInputStream; ObjectInputStream ois = new ObjectInputStream(is); AcmeObject acme = (AcmeObject)ois.readObject;

Unfortunately, the casting operation to AcmeObject occurs after the deserialization process ends. Therefore, it's not useful in preventing any attacks that happen during deserialization from occurring. It's possible that behavior in custom deserialization protocols (for instance, by overriding Serializable#readObject in Java) can be re-purposed towards malicious ends. Researchers have found complex object graphs which, when deserialized, can lead to remote code execution in most Java software.

The final example is a denial-of-service attack against any Java application that allows deserialization. The HashSet called "root" in the following code sample has members that are recursively linked to each other. When deserializing this "root" object, the JVM will begin creating a recursive object graph. It will never complete, and consume CPU indefinitely.

Set root = new HashSet; Set s1 = root; Set s2 = new HashSet; for (int i = 0; i < 100; i++) { Set t1 = new HashSet; Set t2 = new HashSet; t1.add("foo"); // make it not equal to t2 s1.add(t1); s1.add(t2); s2.add(t1); s2.add(t2); s1 = t1; s2 = t2; }

Related Controls

 * Requirements specification: A deserialization library could be used which provides a cryptographic framework to seal serialized data.
 * Implementation: Use the signing features of a language to assure that deserialized data has not been tainted.
 * Implementation: When deserializing data, populate a new object rather than just deserializing. The result is that the data flows through safe input validation and that the functions are safe.
 * Implementation: Explicitly define final readObject to prevent deserialization.

An example of this is:

private final void readObject(ObjectInputStream in) throws java.io.IOException { throw new java.io.IOException("Cannot be deserialized"); }


 * Implementation: Make fields transient to protect them from deserialization.
 * Implementation: In your code, override the ObjectInputStream#resolveClass method to prevent arbitrary classes from being deserialized. This safe behavior can be wrapped in a library like SerialKiller.
 * Implementation: Use a safe replacement for the generic readObject method as seen here. Note that this addresses "billion laughs" type attacks by checking input length and number of objects deserialized.
 * Implementation: Use a Java agent to override the internals of ObjectInputStream to prevent exploitation of known dangerous types as seen in rO0 and NotSoSerial