Generating Semantic Convention libraries
The code for OpenTelemetry Semantic Conventions defined in this repository can be auto-generated.
OpenTelemetry Language SIGs can generate Semantic Conventions code in the form that’s idiomatic for their language and may (or may not) ship it as a stand-alone library.
This document outlines common patterns and provides non-normative guidance on how to structure semantic conventions artifacts and generate the code.
Stability and Versioning
Semantic Conventions contain a mix of stability levels. Language SIGs that ship semantic conventions library may decide to ship a stable artifact with stable part of the Semantic Conventions, a preview artifact with all Semantic Conventions, or other combination that’s idiomatic for this language and provides SemVer 2.0 stability guarantees.
Possible solutions include:
- Generate all Semantic Conventions for a given version in specific folder while keeping old versions intact. It is used by opentelemetry-go but could be problematic if the artifact size is a concern.
- Follow language-specific conventions to annotate experimental parts. For example, Semantic Conventions in Python puts experimental attributes in
opentelemetry.semconv._incubating
import path which is considered (following Python underscore convention) to be internal and subject to change. - Ship two different artifacts: one that contains stable Semantic Conventions and another one with all available conventions. For example, semantic-conventions in Java are shipped in two artifacts:
opentelemetry-semconv
andopentelemetry-semconv-incubating
.
Note: Shipping two versions of the same artifact (stable and preview) could be problematic due to diamond-dependency problems. For example, if user application depends on the
semconv v1.0.0-preview
and some library brings transitive dependency onsemconv v1.1.0
that does not contain experimental conventions, the latter would be resolved leading to compilation or runtime issues in the application.
Instrumentation libraries should depend on the stable (part of) semantic convention artifact or copy relevant definitions into their own code base. Experimental semantic conventions are intended for end-user applications.
Deprecated Conventions
It’s recommended to generate code for deprecated attributes, metrics, and other conventions. Use appropriate annotations to mark them as deprecated.
Conventions have a stability
property which provide the stability level at the deprecation time (experimental
or stable
) and
the deprecated
property that describes deprecation reason which can be used to generate documentation.
- Deprecated conventions that reached stability should not be removed without major version update according to SemVer.
- Conventions that were deprecated while being experimental should still be generated and kept in the preview (part of) semantic conventions artifact. It minimizes runtime issues and breaking changes in user applications.
Keep stable convention definitions inside the preview (part of) semantic conversions artifact. It prevents user code from breaking when semantic convention stabilizes. Deprecate stable definitions inside the preview artifact and point users to the stable location in generated documentation.
For example, in Java, the attribute http.request.method
is defined as deprecated in both stable and preview artifacts (e.g., io.opentelemetry.semconv.incubating.HttpIncubatingAttributes.HTTP_REQUEST_METHOD
, io.opentelemetry.semconv.HttpAttributes.HTTP_REQUEST_METHOD
).
Semantic Conventions Artifact Structure
This section contains suggestions on how to structure semantic convention artifact(s).
- Artifact name:
opentelemetry-semconv
- stable conventionsopentelemetry-semconv-incubating
- (if applicable) the preview artifact containing all (stable and experimental) conventions
- Namespace:
opentelemetry.semconv
andopentelemetry.semconv.incubating
- All supported Schema URLs should be listed to allow different instrumentations in the same application to provide the exact version of conventions they follow.
- Attributes, metrics, and other convention definitions should be grouped by the convention type and the root namespace. See the example below:
├── SchemaUrls.code
├── attributes
│ ├── ClientAttributes.code
│ ├── HttpAttributes.code
│ └── ...
├── metrics
│ ├── HttpMetrics.code
│ └── ...
└── events
└── ...
Generating semantic conventions
This section describes how to do code-generation with weaver.
[!IMPORTANT] We’re transitioning from build-tools to opentelemetry-weaver to generate code for semantic conventions. All new code-generation should be done using weaver, build-tools may become incompatible with future version of semantic conventions.
Code-generation is based on YAML definitions in the specific version of semantic conventions. Usually, it involves several steps where some can be semi-automated: involves several steps which could be semi-automated:
- Manually update the Semantic Conventions version in config
- Add the new Schema URL to the list of supported versions
- If it’s not automated, then it can, at least, be automatically checked.
- Check out (or download) the new version of Semantic Conventions
- Run code-generation script (see below for the details)
- Fix lint violations in the auto-generated code (if any)
- Send the PR with new code to the corresponding repository
Here are examples of how steps 2-5 are implemented for Python and Erlang.
Step 4 (running code generation) depends on language-specific customizations. It’s also the only step that’s affected by tooling migration.
Check out weaver code-generation documentation for more details
Migrating from build-tools
Migration from build-tools involves changing Jinja templates and adding a weaver config file.
Weaver config
Here’s a simplified example of this file that generates all attributes.
params:
excluded_namespaces: [ios, aspnetcore, signalr, android, dotnet, jvm, kestrel]
templates:
- pattern: semantic_attributes.j2
filter: >
semconv_grouped_attributes({
"exclude_root_namespace": $excluded_namespaces
})
| map({
root_namespace: .root_namespace,
attributes: .attributes,
output: $output + "attributes/"
})
application_mode: each
You can configure language-specific parameters in the params
section of the config or pass them with -DparamName=value
arguments when
running weaver command from the code generation script (similarly to build-tools).
Weaver is able to run code-generation for multiple templates (defined in the corresponding section) at once.
Before executing Jinja, weaver allows to filter or process semantic convention definitions in the filter
section for each template.
In this example, it uses semconv_grouped_attributes
filter - a helper method that groups attribute definitions by root namespace and excludes
attributes not relevant to this language. You can write alternative or additional filters and massage semantic conventions data using JQ.
In certain cases, calling semconv_grouped_attributes
with namespace exclusion and stability filters may be enough and post-processing is not necessary.
The application_mode: each
configures weaver to run code generation for each semantic convention group and, as a consequence,
generate code for each group in a different file. The application mode single
is also supported to apply the template to all groups at once.
See weaver code-generation docs for the details on the config, data schema, JQ filters, and more.
Jinja templates
Jinja templates need to be changed to leverage (better) data structure and helper methods.
The first key difference is that each jinja template can define how to name the corresponding file(s). If you
don’t specify the name of the output file via the method set_file_name
, Weaver will use the relative path
and the name of the template itself to determine the output file.
E.g. here’s an example that uses root namespace in a subfolder provided in the output
parameter.
{% set file_name = ctx.output + (ctx.root_namespace | snake_case ) ~ "_attributes.py" -%}
{{- template.set_file_name(file_name) -}}
Notable changes on data structure:
attributes_and_templates
->ctx.attributes
enum_attributes
->ctx.attributes | select("enum")
metrics
->ctx.metrics
root_namespace
->ctx.root_namespace
(only available if usingsemconv_grouped_attributes
or similar filter)'- all custom parameters are provided as properties under
ctx
variable. attribute.fqn
->attribute.name
attribute.type | instantiated_type
(gets underlying type of enum values)attribute.attr_type.members
->attribute.type.members
(gets members of enum type)member.member_id
->member.id
(gets id of the enum member)
Notable changes on helper methods:
attr.fqn | to_const_name
->attr.name | screaming_snake_case
attr.fqn | to_camelcase(True)
->attr.name | pascal_case
attr.brief | to_doc_brief | indent
->attr.brief | comment_with_prefix(" ")
(prefix is used to indent)- stability/deprecation checks:
attribute is stable
if checking one attribute,attributes | select("stable")
to filter stable attributesattribute is experimental
if checking one attribute,attributes | select("experimental")
to filter experimental attributesattribute is deprecated
if checking one attribute,attributes | select("deprecated")
to filter deprecated attributes
- check if attribute is a template:
attribute.type is template_type
print_member_value
- no replacement at this time, use something like{%- if type == "string" -%}"{{value}}"{%-else-%}{{value}}{%-endif-%}
- new way to simplify switch-like logic:
key | map_text("map_name")
. Maps can be defined in the weaver config. It can be very useful to convert semantic convention attribute types to language-specific types.
Feedback
Was this page helpful?
Thank you. Your feedback is appreciated!
Please let us know how we can improve this page. Your feedback is appreciated!