ICH M11 establishes a harmonised template for clinical study protocol content and — more significantly — introduces the concept of a digital protocol: a machine-readable protocol representation that can be processed by regulatory agencies, shared between sponsors and sites, and linked to clinical data systems without manual transcription. The organisations that implement M11 as a semantic data management exercise, rather than as a document formatting exercise, position themselves for the full efficiency dividend that the standard was designed to deliver.
The Digital Protocol Concept
The M11 digital protocol is based on a data model in which protocol elements — eligibility criteria, endpoints, assessments, visit schedules — are expressed as structured data objects rather than narrative text. Each eligibility criterion is represented as a logical condition operating on patient characteristics expressed in standardised terminology. Each endpoint is linked to the assessment procedure that generates it, the statistical analysis that will evaluate it, and the ontological concept that defines its clinical meaning. This structure makes the protocol machine-readable in a way that a word-processed document, however well-formatted, can never be.
Semantic Modelling of Eligibility Criteria
Eligibility criteria are among the most knowledge-intensive elements of a clinical protocol. An inclusion criterion such as "patients with a confirmed diagnosis of HER2-positive breast cancer, defined as IHC 3+ or ISH amplification ratio ≥ 2.0" contains multiple layered conditions that must be expressed in structured form: the disease concept (breast cancer), the biomarker status (HER2-positive), and the specific measurement thresholds that define the biomarker status. Representing these conditions using ontological concept identifiers — the disease concept linked to a disease ontology, the biomarker linked to a molecular entities ontology, the measurement threshold linked to a units ontology — makes the criterion directly translatable into database queries for cohort identification, without human interpretation at the point of execution.
Cross-Protocol Analytics
When protocols are authored as structured semantic data, cross-protocol analytics become possible that are not achievable from document-based protocols. Programme directors can query the full protocol portfolio for: all studies that include patients with a specific eligibility condition, all studies using a specific endpoint definition, all studies that share a baseline assessment procedure that could be harmonised. This portfolio-level visibility into protocol structure supports programme strategy decisions, regulatory negotiation preparation, and the identification of cross-study analysis opportunities that would otherwise go unrecognised.