Submission Templates

Model Card Template - Metadata

### Model Card Template - Metadata

# How to fill in this template:
#
# 1. For simple fields, add your text within the quotations:
#      compute_requirement: "GPU"
#
# 2. For fields that can have multiple entries, add a block for each one:
#      authors:
#        - name: "Jane Doe"
#          type: "individual"
#          affiliates: ["Chan Zuckerberg Initiative", "Stanford University"]
#        - name: "Chan Zuckerberg Initiative"
#          type: "organization"
#
# 3. For fields with suggested options, uncomment the ones that apply to your model:
#      tasks_performed_by_model:
#         - Cell Clustering
#      #  - Cell Labeling
#      #  - Cross Species Integration
#
#    You can also add your own items to some lists:
#      tasks_performed_by_model:
#        - Cell Clustering
#        - My Custom Task
#
# 4. For optional fields:
#    If a field doesn't apply to your model, make sure the entire entry
#    is commented out (or deleted) rather than leaving it blank:
#
#    Instead of this:
#      finetuned_from:   # (optional) If applicable. Add one or more.
#        - model_name: ""
#          model_url: ""
#
#    Do this:
#     # finetuned_from:   # (optional) If applicable. Add one or more.
#     #  - model_name: ""
#     #    model_url: ""

### Basic Info
model_display_name: ""
model_version: ""                      # vX or vX.X.X or YYYY-MM-DD format
primary_contact_email: ""
repository_link: ""
publication_preprint_link: ""
release_date: ""                       # YYYY-MM-DD
model_modality:                        # can be "Imaging" or "Transcriptomic" or "Reasoning" or "Genomic" (or more than one of those)
#  - Imaging
#  - Transcriptomic
#  - Reasoning
#  - Genomic
model_description: ""
short_description: ""                  # Max 160 characters
authors:                               # List one or more individuals or organizations
  - name: ""
    type: ""                           # one of "individual" or "organization"
#    affiliates: [""]                  # (optional) if it is an "individual", add one or more affiliated organization name
licenses:                              # Add one or more licenses
  - type: ""                           # e.g "CC BY 4.0"
    url: ""                            # e.g "https://creativecommons.org/licenses/by/4.0/deed.en"
compute_requirement: ""                # Minimum compute requirement: one of "GPU" (requires GPU) or "CPU" (runs on CPU or GPU)
# system_requirements: ""              # (optional) Any other system constraints or requirements you would like to describe in a few sentences.

### Model Details
model_architecture_type:               # Choose from any architectures or packages below and/or add onto the list
#  - PyTorch
#  - TensorFlow
#  - JAX
#  - Safetensors
#  - Transformers
#  - PEFT
#  - TensorBoard
#  - GGUF
#  - Diffusers
#  - ONNX
#  - stable-baselines3
#  - sentence-transformers
#  - ml-agents
#  - MLX
#  - TF-Keras
#  - Adapters
#  - Keras
#  - setfit
#  - timm
#  - sample-factory
#  - Transformers.js
tasks_performed_by_model:               # Choose from any tasks below and/or add onto the list
#  - Batch Correction
#  - Causal Inference
#  - Cell Clustering
#  - Cell Labeling
#  - Cell Morphology Profiling
#  - Cell Type Annotation
#  - Cell Type Classification
#  - Contrast Transfer Function (CTF) Estimation
#  - Cross Species Integration
#  - CryoET Particle Picking
#  - Data Integration
#  - Differential Expression
#  - Feature Extraction
#  - Frame Alignment & Motion Correction
#  - Gene Co-expression Prediction
#  - Gene Network Inference
#  - Hypothesis Generation
#  - Image Segmentation
#  - Imputation
#  - Perturbation Detection
#  - Perturbation Prediction
#  - Protein Localization
#  - Synthetic Data Generation
#  - Tomogram Alignment
#  - Tomogram Reconstruction
#  - Virtual Staining
# finetuned_from:                       # (optional) If applicable. Add one or more.
#  - model_name: ""
#    model_url: ""
# model_variants:                       # (optional) If applicable. Add one or more.
#  - variant_name: ""
#    variant_description: ""
#    variant_url: ""

### Training Details
# training_date: ""                     # (optional) YYYY-MM-DD
uses_synthetic_data: ""                 # one of "Yes" or "No". Was synthetic data used in developing the model?
uses_purchased_licensed_data: ""        # one of "Yes" or "No". Were any datasets used for model training purchased or licensed?
# flops_used: ""                        # (optional) If available, please share the approximate number of FLOPs used for training your model.
input_data_type:                        # Choose from any data types below and/or add onto the list
#  - pdf
#  - svg
#  - png
#  - jpg
#  - zarr
#  - czi
#  - tiff
#  - mrc
#  - hdf5
#  - csv
#  - docx
#  - txt
output_data_type:                       # Choose from any data types below and/or add onto the list
#  - embeddings
#  - cell types
#  - images
dataset_sources:                        # Add one or more datasets used to develop this model
  - dataset_name: ""
    dataset_version: ""
    dataset_url: ""                     # If the dataset has no public url, please fill out the dataset_ingestion_template.yaml file and add the name of the file here. e.g. dataset_ingestion_myModel.yaml
    usage:
#      - training                       # can be "training" | "evaluation" | "pre-training" or more than one of those
#      - evaluation
#      - pre-training

### Other Resources
# related_models:                       # (optional) Link any other models your model should be associated with.
#  - model_name: ""
#    model_url: ""
# related_datasets:                     # (optional) Link any other datasets your model should be associated with.
#  - dataset_name: ""
#    dataset_url: ""

# See the Model Contribution Docs for detailed instructions on providing a Quickstart and Tutorial link: https://chanzuckerberg.github.io/virtual-cells-platform/
quickstart_link: ""                     # template -> https://colab.research.google.com/drive/1VfrAM-BxXwUveDqhdwViD8BOH5yk_FU6
# tutorial_link: ""                     # (optional but encouraged) template -> https://colab.research.google.com/drive/1DrRY_mJyYkx3Bg-Y9Ev9X209jGwt8wVF?usp=sharing
model_download_link: ""                 # (optional if model is packaged in MLFlow) e.g. s3://..

Model Card Template - Details

Save template

# Model Card Template - Details
<!-- You can use standard [Markdown](https://www.markdownguide.org/basic-syntax/) in this file to format your responses including lists, tables, links, and headings.

To include images in your model card, place them in the `model_card_images/` folder and reference them like so:

![Descriptive alt text that can also serve as a caption](./model_card_images/your_image.png)

Write descriptive alt text that explains what's in the image for screen readers. This is crucial for users with visual impairments. For guidance:
- [WebAIM Alt Text Guide](https://webaim.org/techniques/alttext/)
- [W3C Image Decision Tree](https://www.w3.org/WAI/tutorials/images/decision-tree/) -->

<!-- MODEL DETAILS SECTION -->

## Model Details

### Model Architecture
<!-- Brief description of the model architecture (e.g., number of layers and attention heads, embedding dimensions, input size or context length) and rationale behind it -->

...

### Parameters
<!-- Number of parameters (e.g., 15 million) -->

...

### Citation
<!-- Provide citation information for users of the model -->

...

<!-- INTENDED USE SECTION -->

## Intended Use
<!-- This section addresses questions around how the model is intended to be used in different applied contexts, discusses the foreseeable users of the model (including those affected by the model), and describes uses that are considered out of scope or misuse of the model. -->

### Primary Use Cases
<!-- List primary use cases (e.g., cell type classification, perturbation prediction, protein localization, cell morphology profiling). You can include the 'tasks_performed_by_model' that you provided in model_card_details.yaml as a starting point.  -->

...

### Out-of-Scope or Unauthorized Use Cases
<!-- Suggested Text:

"Do not use the model for the following purposes:
 - Use that violates applicable laws, regulations (including trade compliance laws), or third party rights such as
   privacy or intellectual property rights.
 - Any use that is prohibited by the [link to model license] license.
 - Any use that is prohibited by the Acceptable Use Policy."

[Please include other specific out-of-scope use cases that may be relevant for this model, as applicable]
-->

...

<!-- TRAINING DETAILS SECTION -->
<!-- This section provides information to describe and replicate training, including the training data and the speed and size of training elements. -->

## Training Data
<!-- Brief description of the training data, including type of data and dataset size (e.g., 30M cells, 1M cell images with annotations in 35 organelles and subcellular structures), and if possible, include links to data and/or pre-processed data. -->

...

### Training Procedure
<!-- Briefly describe the training approach including data pre-processing steps (e.g., steps taken to clean and preprocess the data, detail tokenization, modality dependent resizing/rewriting) -->

...

### Training Code
<!-- (optional, but strongly encouraged) Provide links to training scripts -->

...

### Speeds, Sizes, Times
<!-- (optional, include if available) Provide information about throughput, start/end time, checkpoint size if relevant, etc. (optional, include if available) -->

...

### Training Hyperparameters
<!-- (optional, include if available) Examples: fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision  -->

...

<!-- PERFORMANCE METRICS SECTION -->

## Performance Metrics
<!-- This section describes the evaluation protocols, what is being measured in the evaluation, and provides the results. -->

### Metrics
<!-- List evaluation metrics used and the rationale for using them along with links where applicable. If this model was benchmarked against existing models, list them here and explain the rationale for comparison.

For example:
"The model was evaluated using a range of benchmarks to measure its performance.
Key metrics include: [metrics]." -->

...

### Evaluation Datasets
<!-- List the evaluation datasets along with links to the data where possible. Link to evaluation datasets processing code if available. -->

...

### Evaluation Results
<!-- Provide table and/or figures summarizing evaluation results -->

...

<!-- BIASES, RISKS, AND LIMITATIONS SECTION -->

## Biases, Risks, and Limitations
<!-- This section identifies potential harms, misunderstandings, and technical and limitations. It also provides
information on warnings and potential mitigations. Suggestions are provided below. -->

### Potential Biases
<!-- Suggested Text:

"- The model may reflect biases present in the training data.
 - Certain demographic groups may be underrepresented."

[Please include other specific biases that may be relevant for this model, as applicable] -->

...

### Risks
<!-- Suggested Text:

"Areas of risk may include but are not limited to:
 - Inaccurate outputs or hallucinations
 - Potential misuse for incorrect biological interpretations."

[Please include other specific risks that may be relevant for this model, as applicable] -->

...

### Limitations
<!--
Suggested Text:

"- The model may not perform well on general tasks."

[Please include other specific limitations that may be relevant for this model, as applicable]
-->

...

### Caveats and Recommendations
<!-- (optional)
Suggested Text:

"- Review and validate outputs generated by the model.
 - We are committed to advancing the responsible development and use of artificial intelligence. Please follow our Acceptable Use Policy when using the model."

For CZI and CZ Biohub models:
"- Should you have any security or privacy issues or questions related to the model, please reach out to our team at security@chanzuckerberg.com or privacy@chanzuckerberg.com, respectively."

[Please include other recommendations that may be relevant for users of this model, as applicable]
-->

...

<!-- ACKNOWLEDGEMENTS -->

## Acknowledgements
<!-- (optional) This section is for providing acknowledgement of other contributors or supporting organizations. -->

...