Cedar policy validation against schema

Cedar policies are code, and as with all code it is possible to make mistakes that mean the code will not behave as expected. For example, the following is a well-formed Cedar policy according to syntax rules, but with a number of typos and type errors.

permit (
    principal == ExampleCo::Uzer::"12345",   // should be "User", not "Uzer"
    action == ExampleCo::Action::"ReadFile", // should be "readFile", not "ReadFile"
    resource == ExampleCo::User::"67890"     // "readFile" isn't a valid operation on a User
)
when {
    principal.isAcctive           // should be "isActive", not "isAcctive"
      &&
    principal.username > 2        // comparing a string against a number
};

Cedar can’t know whether this policy is right or wrong by examining it in isolation. For example, Cedar does not know if the policy author meant Uzer or User because both are well-formed names. If the policy were subsequently evaluated during an authorization decision, it either would not match at all due to the mistakes in the policy scope (e.g., if there are no entities of type Uzer then the first comparison would always be false), or if the scope’s errors were corrected the evaluator would return diagnostics about accessing undefined attributes and performing invalid comparisons of strings and integers. Ultimately, the policy will have no impact: policies that always evaluate to false or exhibit errors during evaluation are ignored.

If no policy grants access, Cedar returns a default decision of DENY. However, it can be frustrating when a policy isn’t behaving as expected. To avoid this frustration, it is better to learn that a policy is invalid when you’re creating it, so mistakes can be fixed before they have an impact on your application’s operation.

This capability is provided by Cedar validation. To validate a policy, Cedar needs information about the application. It needs to know the correct names of entity types, the attributes they possess, and the allowed parent/child relationships. It also needs to know which actions are allowed, and the expected types of the principal, resource, and context components that are part of requests made with this action. All of this information is provided to Cedar by defining a schema.

This topic provides a brief overview of schemas and how they work to provide validation. For a full description of the schema format with a sample schema, see Cedar schema format.

If you change your schema, any policies that you validated before the change might no longer be valid. Those policies can then generate errors during authorization queries if you include entities that match the updated schema in your request. Policies that result in errors aren’t included in the authorization decision, possibly leading to unexpected results. Therefore, we strongly recommend that you review your policies to see which might be affected by the schema change, and edit those policies so that they accurately reflect the entities that you now include in your evaluation requests.

Topics on this page

Example of schema-based validation
Supported validation checks
Request validation expectations
Benefits of validation and schemas

Example of schema-based validation

The following is an example of a basic Cedar schema.

{
    "ExampleCo::Personnel": {
        "entityTypes": {
            "Employee": {
                "shape": {
                    "type": "Record",
                    "attributes": {
                        "name": { "type": "String" },
                        "jobLevel": { "type": "Long" },
                        "numberOfLaptops": {
                            "type": "Long",
                            "required": false
                        }
                    }
                }
            },
            "System": {}
        },
        "actions": {
            "remoteAccess": {
                "appliesTo": {
                    "principalTypes": ["Employee"],
                    "resourceTypes": ["System"]
                }
            }
        }
    }
}

This schema specifies the following:

The entities defined in this schema exist in the namespace ExampleCo::Personnel. References to those entities within policies require the namespace prefix, e.g., ExampleCo::Personnel::Employee. References to those entities within the schema, within the namespace declaration, need no namespace prefix, e.g., we write just Employee in the principalTypes part, rather than ExampleCo::Personnel::Employee.
Every entity of type Employee in the store has an attribute name with a value that is a Cedar String, an attribute jobLevel with a value that is a Cedar Long, and an optional attribute numberOfLaptops that is also a Cedar Long.
Entities of type System are expected to have no attributes.
Any authorization request from the application that specifies action Action::"remoteAccess" is expected to specify only principals that are of type Employee and resources that are of type System.

The schema can also specify the expected format of the context record for each Action. Making this specification lets Cedar also flag errors on references to context.

Consider the following policy.

permit (
    principal,
    action == ExampleCo::Personnel::Action::"remoteAccess",
    resource
)
when {
    principal.numberOfLatpops < 5 &&        // (1)
    principal.name > 3 &&                   // (2)
    principal.jobLevel == "somethingelse"   // (3)
};

The Cedar validator knows that any request that triggers evaluation of this policy must have action Action::"remoteAccess". According to our example schema, such a request must have a principal of type Employee. With this knowledge, validation will report an error or warning on each of the comparisons 1 through 3 in the when clause for the following reasons:

Validation error – The policy tries to access an attribute that isn’t defined for Employee types. In this case, the error is because of a typo (numberOfLatpops instead of numberOfLaptops).
Validation error – The left operand of > is of type String. However, > only accepts operands of type Long, so this policy always raises a runtime error.
Validation warning – The left operand of == is always of type Long and the right operand is always a String. Because the == operator always returns false if its operands have different runtime types, this comparison always returns false. Although this comparison won’t raise a runtime error during evaluation, it probably isn’t what the policy author intended and so is flagged as a validation warning.

Supported validation checks

The validator compares a policy with a schema to look for inconsistencies. From these inconsistencies, the validator detects the following errors:

Unrecognized entity types – For example, misspelling File as Filee.
Unrecognized actions – For example, misspelling Action::"viewFile" as Action::"viewFiel".
Action applied to unsupported principal or resource – For example, saying a File can View a User.
Improper use of in or == – For example, stating principal == Folder::"folder-name" when a principal can’t be a Folder and you meant to write in.
Unrecognized attributes – For example, principal.jobbLevel has a typo and should be jobLevel.
Unsafe access to optional attributes – For example, principal.numberOfLaptops where numberOfLaptops is an optional attribute declared with required : false. Such tests should be guarded by including a has check as the left side of the shortcircuiting && expression. For example, as in principal has numberOfLaptops && principal.numberOfLaptops > 1.
Type mismatch in operators – For example, principal.jobLevel > "14" is an invalid comparison with a String.
** Invalid entity literals of enumerated entity types ** – For example, Application::"TinyTODO" is an invalid entity literal if entity type Application is an enumerated type and TinyTodo is the only allowed EID.

The validator also looks for some suspicious situations that, while not runtime errors, are likely to be incorrect code. These are reported as the following warnings:

Cases that always evaluate to false, and thus never apply – For example, when { principal has manager && principal.manager == User::"Ethel" } always evaluates to false when the type of principal will never have the manager attribute, as made clear in the schema, so the policy can never apply. Similarly, principal is ExampleCo::Personnel::Admin always evaluates to false when the principal is always a User, and not an Admin.
Mixed script strings and identifiers – When a single string or identifier contains multiple unicode scripts (different writing system), it is possible for the string to appear to say something it doesn’t. For example, the latin and cyrillic “a” character may appear identical in some fonts.
Bidirectional text control characters in strings and identifiers – These unicode characters can be used to craft strings that obfuscate true control flow.
Unexpected characters in entity identifiers – While Cedar can support any string as an entity identifier, we recommend limiting them to printable ASCII characters (including spaces, but excluding tabs and new lines) and characters inside the Unicode General Security Profile for Identifiers.

Request validation expectations

As implied by the discussion above, we expect validation to be performed before a policy is used by the authorization engine to decide authorization requests. Indeed, the Cedar authorization APIs do not perform validation at the same time that a request is evaluated. Rather, validation is an entirely separate API which can be invoked when policies are loaded or created.

We expect that all authorization requests adhere to the rules given in the schema used to validate the policies. In particular:

For a request with components PARC (principal, action, resource, context), the A component must be an action enumerated in the actions part of the schema, and the PRC components will have the types given with A in the schema. Our example schema above states that A must always be ExampleCo::Personnel::Action::"remoteAccess" (since it’s the only action given in the schema), and for this action P must be an entity of type ExampleCo::Personnel::Employee, R must be an entity of type ExampleCo::Personnel::System, and C must be the empty record {} (since no information about the context is given).
The entities used when evaluating the request must have the structure given in the entityTypes part of the schema. Our example schema above states that ExampleCo::Personnel::Employee entities have at least two attributes (name and jobLevel) and optionally have a third (numberOfLaptops), each with the types given (String, Long, and Long, respectively). Schemas may also specify the expected hierarchical relationships among entities (not shown in the example).

If these expectations are not met then a policy that the validator accepts as valid may fail with an error when evaluated, causing it to be ignored. To see why, consider the following policy, which passes validation when using our example schema.

permit (principal, action, resource)
when {
    principal.name == "superuser" ||
    principal.jobLevel > 8
};

This policy states that any principal whose name is "superuser" or whose jobLevel is greater than 8 can perform any action on any resource. According to our example schema, all principals are expected to have type Employee, which is the only principal type given for the sole action listed.

Now suppose we submitted the following authorization request:

P = ExampleCo::Personnel::Employee::"Rick"
A = ExampleCo::Personnel::Action::"remoteAccess"
R = ExampleCo::Personnel::System::"dev"
C = {}
The attributes of entity ExampleCo::Personnel::Employee::"Rick" are the record { "firstName": "Rick", "jobLevel" : "admin" }

For this request the PARC components conform to the schema, but the attributes of entity ExampleCo::Personnel::Employee::"Rick" do not: The schema prescribes that attributes name and jobLevel must be present, and the latter is mapped to value of type Long, but neither is true of the entity given in the request. If we evaluated the policy on this request the policy’s when-condition expression principal.name == "superuser" would fail with a message like ExampleCo::Personnel::Employee::”Rick” does not have the required attribute: name. If we changed the entity in the request so that firstName was instead name as required by the schema, evaluation would fail on principal.jobLevel > 8 with a message like type error: expected long, got string.

By default, it is entirely up to the application to make sure that authorization requests are well-formed according to the schema’s expectations. However, Cedar provides utilities to optionally validate a PARC request adheres to the expectations given in the schema. The following are the currently available validation utilities:

Cedar CLI
Request::new() in the Rust API

Applications can also choose to use schema-based parsing to ensure that JSON data used to describe entities and/or a request’s context C match the prescriptions of the schema. For example, schema-based parsing would catch the issue above by flagging { "firstName": "Rick", "jobLevel" : "admin" } as an invalid entity of type Employee (assuming C was created by parsing a JSON representation of the context data). If an application writer is sure that requests will always match the schema’s expectations by construction, they can elect to skip these steps.

You can think of a schema as a contract between the application and the policies: If the application provides requests and data that follow the prescriptions in the schema, then evaluating policies validated against that schema will surely avoid several classes of error. (The end of this section discusses in detail what errors are and are not precluded by validation.)

Note that this contract implies that if an application’s schema changes then so has its authorization model, i.e., the actions and/or entities it may submit to the Cedar authorization engine, and their structure. Policies still in effect may need to be revalidated to make sure they are consistent with these changes.

Benefits of validation and schemas

Performing validation before using your policies gives you a significant benefit, called validation soundness: If your policies are deemed valid, they are sure not to exhibit most errors that could arise during request evaluation, for requests that adhere to the expectations defined by the schema. We have formally proved validation soundness as part of the novel verification guided development process we used to build Cedar. In particular, we implemented a version of the validator in the Lean programming language and theorem prover, and used automated reasoning to prove the validation soundness property. Then we performed extensive differential testing to make sure that our Rust implementation of the validator behaves the same as the Lean version does.

Validation soundness ensures the absence of most, but not all errors that could arise during policy evaluation. The only errors that are not precluded are the following:

Errors due to integer overflow. In Cedar when you add two large Long numbers together the result may be too big to fit in 64 bits. Rather than wrap around (e.g., producing a negative number) as in many languages, Cedar throws an error. Validation does not currently attempt to detect this possibility.
Errors due to missing entities. If a policy references an entity that does not exist in the entities used to evaluate the policy, any attempt to access that entity’s attributes will fail. This could happen with an entity literal (e.g., User::"Rick".name == "rick") or with an entity passed in as a principal or resource (e.g., principal.name == "rick", or principal.manager.name == "Vijay" where principal.manager should be an entity). Request validation (RFC 11) and schema-based JSON parsing do not confirm the existence of entities.
Errors due to incorrect extension values (in non-strict mode). Extension values are constructed by calling a constructor with a string. For example, IP address values are constructed with ip(). For policies that pass non-literal strings to these functions, there is a risk that the string is not well-formed, and thus evaluating it will produce an error. For example, if we had a policy with the expression ip(principal.IPAddr) and principal.IPAddr happened to be the string "XYZ" then evaluating the policy would fail with an error. However, by default the validator runs in a strict mode that forbids passing non-literal strings to extension function constructors; in this mode, the expression above will fail to validate with the error extension constructors may not be called with non-literal expressions. An expression like ip("XYZ") in a policy will fail to validate (regardless of mode).

All other errors (as enumerated earlier) will never happen.

We close by noting that defining a schema is useful for purposes other than validation.

Because a schema describes the properties of an authorization system, they can serve as an input to other tooling, such as documentation generators.
Schemas can be used to generate policy editor interfaces in situations where end-users manage fine-grained rules through point-and-click selections.
Analytics engines that query a body of policies to answer questions might rely on the existence of a schema to produce the most accurate reports.
Sample solutions for authorization scenarios are typically expressed using a schema file. For example, you can answer the question “How can I model situation XYZ?” with a schema that describes the modeling approach.

Although you can get started in Cedar without using a schema, we encourage you to define and use one, especially as your project moves beyond initial prototyping and toward a production release.