Implement Model Parsing

This guide explains how to implement the ExtractModelInformation method to validate simulator model files and optionally extract metadata.

What is Model Parsing?

Model parsing is the process of validating/parsing if the file associated with a given simulator model revision is valid. Besides checking if the file is valid, you can also extract useful information from a model file, such as the flowsheet structure (nodes, edges, thermodynamics) and arbitrary metadata (info).

Minimum Requirements

At a minimum, your ExtractModelInformation implementation must:

Validate the file exists and can be opened
Set success or failure status

Basic Implementation

Here's a minimal implementation that just validates the file:

public async Task ExtractModelInformation(
    DefaultModelFilestate state,
    CancellationToken token)
{
    await semaphore.WaitAsync(token).ConfigureAwait(false);
    dynamic? workbook = null;

    try
    {
        Initialize();

        logger.LogInformation($"Validating model: {state.FilePath}");

        // Just try to open the file
        workbook = OpenBook(state.FilePath);

        if (workbook == null)
        {
            state.ParsingInfo.SetFailure("Failed to open model file");
            return;
        }

        // File is valid - report success with no extracted data
        state.ParsingInfo.SetSuccess();
        logger.LogInformation("Model validation successful");
    }
    catch (Exception ex)
    {
        logger.LogError(ex, "Error validating model");
        state.ParsingInfo.SetFailure(ex.Message);
    }
    finally
    {
        if (workbook != null)
        {
            try
            {
                workbook.Close(false);
            }
            catch (Exception ex)
            {
                logger.LogWarning(ex, "Error closing workbook");
            }
        }
        Shutdown();
        semaphore.Release();
    }
}

Optional: Extract Flowsheets

If your model contains a flowsheet, you can optionally extract flowsheet information from the model. This creates a browsable structure in CDF that users can reference when creating routines.

Flowsheet Structure

The API supports saving:

Flowsheets - Hierarchical structure of simulator objects
- Nodes - Objects in the flowsheet (streams, unit operations, etc.)
- Edges - Connections between nodes
- Thermodynamics - Thermodynamic package information

When to Extract Flowsheets

Extract flowsheets when:

Your simulator has a clear object model (streams, units, operations)
Users would benefit from browsing model structure in CDF
You want to enable validation of routine references
The simulator API makes extraction straightforward

Optional: Extract Info

You can also optionally extract arbitrary metadata about the model using the info field. This is a key-value structure that can hold any JSON-serializable data.

When to Use Info

Use info for:

Model metadata that doesn't fit the flowsheet structure
Version information
Author/creation details
Custom simulator-specific data
Any JSON-serializable information

Summary

For the Excel connector tutorial, our basic implementation simply validates that the workbook can be opened. This is sufficient for most use cases.

Next: Continue to Implement Routines to add simulation execution capabilities.

Table of Contents