Schema.org is a col­lab­o­ra­tive project between the search engine providers Google, Microsoft, Yahoo, and Yandex. The goal of the ini­tia­tive is to stan­dard­ize semantic an­no­ta­tion for website markup. Its af­fil­i­at­ed website contains the Schema.org markup: a com­pre­hen­sive vo­cab­u­lary that allows ad­di­tion­al in­for­ma­tion to be entered in HTML code. This machine-readable content makes it possible for semantic contexts to then be read out by programs like web crawlers. This concept, known as struc­tured data, comprises the basis of the semantic web. Schema.org’s markup language supports the formats RDFa, Microdata, and JSON-LD.

Three reasons for Schema.org

Struc­tured data is becoming more and more important for web owners. Unlike human visitors, computer programs need a cor­re­spond­ing markup language in order to process a website’s semantic contexts. Struc­tured data plays a crucial rule when it comes to indexing, but website owners can also benefit from semantic notation. Content that is labeled in a unified matter enables search engines to tar­get­ed­ly extract in­for­ma­tion and display this in their own formats on search engine result pages (SERPs). The most important aspect for this process is a feature called Rich Snippets. This name refers to the bits of in­for­ma­tion that expand search results, gen­er­at­ing more attention in the process. In order to guarantee uni­for­mi­ty for these semantic an­no­ta­tions (which use many different data types), Schema.org supplies users with stan­dard­ized schemata, known as ‘types’. These types have the ability to then be further defined with a specified set of ‘prop­er­ties’. Website owners have three major ad­van­tages when they use Schema.org’s markup.

  • Stan­dard­iza­tion: Schema.org’s markup is based on an agreement between the world’s largest search engine providers. Content that is annotated according to Schema.org is es­sen­tial­ly being labeled with a markup that is largely con­sid­ered to be today’s de facto standard.
  • Com­pat­i­bil­i­ty: stan­dard­ized types and support from the most commonly used data formats–RDFa, Microdata, and JSON-LD –help ensure for maximum com­pat­i­bil­i­ty of semantic an­no­ta­tion. Content marked up according to Schema.org is ready to be processed by the internet’s most common web clients.
  • Val­i­da­tion: a unified standard offers the advantage of quick, un­com­pli­cat­ed error analyses. Tagging according to Schema.org with Microdata, RDFa, and JSON-LD can be easily validated with Google’s free struc­tured data testing tool.

Schema.org type hi­er­ar­chies

Schema.org’s markup is a system composed of types and prop­er­ties. While types are able to assign content elements in order to label these entities as clearly defined items, prop­er­ties are used to further define these types with certain values. Schema.org provides a hi­er­ar­chi­cal­ly struc­tured set of types specif­i­cal­ly for this purpose; the site also lays out which prop­er­ties are able to be assigned to which types.

The spectrum of types that items can be assigned to ranges from broadly defined de­f­i­n­i­tions, like ‘Thing’, to more specified types, like ‘Movie’. Each type, with the exception of the un­spec­i­fied category, ‘Thing’, has one or more parent types.

Thing > Cre­ative­Work > Movie

While ‘Thing’ labels an un­spe­cif­ic item, the type, ‘Cre­ative­Work’ labels a more specific aspect related to the category its name describes (e.g. books, pho­tog­ra­phy, or movies). The type, ‘Movie’, is a sub­cat­e­go­ry of the type ‘Cre­ative­Work’, which in turn is a sub­cat­e­go­ry of ‘Thing’. This hierarchy shows that specific types are inherent to the prop­er­ties of their parent types. An item labeled as ‘Movie’ can be paired with the specific prop­er­ties of a given movie: movies have actors, directors, trailers, etc. These are all pos­si­bil­i­ties for further defining ‘Movie’.

Prop­er­ties of the parent type ‘Cre­ative­Work’ also apply to the type ‘Movie’. Like all forms of sto­ry­telling, movies have an author, a genre, and reviews. The type ‘Movie’ also receives the prop­er­ties of the parent type ‘Thing’: name, de­scrip­tion, URL, and image. All prop­er­ties can be labeled with in­di­vid­ual values according to the item that’s been assigned to them.

Types and prop­er­ties defined on Schema.org are equally used by RDFa, Microdata, and JSON-LD. However, the way in which the markup is in­cor­po­rat­ed into a website varies greatly depending on which data format is used.

Choosing a data format

Once you’ve decided to use Schema.org’s markup vo­cab­u­lary, the next step is to choose a data format. A detailed an­no­ta­tion de­scrip­tion with RDFa, Microdata, and JSON-LD based on the stan­dard­ized types and prop­er­ties of Schema.org markups is laid out for you in the following articles:

Go to Main Menu