Skip to content

Class: File

A single file within a dataset or file collection. Represents an individual data file, code file, documentation file, etc. Maps to RO-Crate File entities.

URI: schema:MediaObject

classDiagram class File click File href "../File/" Information <|-- File click Information href "../Information/" File : bytes File : compression File --> "0..1" CompressionEnum : compression click CompressionEnum href "../CompressionEnum/" File : conforms_to File : conforms_to_class File : conforms_to_schema File : created_by File : created_on File : description File : dialect File : doi File : download_url File : encoding File --> "0..1" EncodingEnum : encoding click EncodingEnum href "../EncodingEnum/" File : file_type File --> "0..1" FileTypeEnum : file_type click FileTypeEnum href "../FileTypeEnum/" File : format File --> "0..1" FormatEnum : format click FormatEnum href "../FormatEnum/" File : hash File : id File : issued File : keywords File : language File : last_updated_on File : license File : md5 File : media_type File --> "0..1" MediaTypeEnum : media_type click MediaTypeEnum href "../MediaTypeEnum/" File : modified_by File : name File : page File : path File : publisher File : sha256 File : status File : title File : version File : was_derived_from

Inheritance

Slots

Name Cardinality and Range Description Inheritance
bytes 0..1
Integer
Size of the data in bytes direct
path 0..1
String
The file path or URL where the content is located direct
format 0..1
FormatEnum
The file format, physical medium, or dimensions of a resource direct
encoding 0..1
EncodingEnum
The character encoding of the data direct
compression 0..1
CompressionEnum
Compression format used, if any (e direct
media_type 0..1
MediaTypeEnum
The media type of the data direct
hash 0..1
String
Cryptographic hash value of the data for integrity verification (e direct
md5 0..1
String
MD5 hash value of the data (128-bit cryptographic hash) direct
sha256 0..1
String
SHA-256 hash value of the data (256-bit cryptographic hash, recommended) direct
dialect 0..1
String
Specific format dialect or variation (e direct
file_type 0..1
FileTypeEnum
Semantic type or purpose of this file (e direct
conforms_to 0..1
String
An established standard, specification, or schema to which the resource confo... Information
conforms_to_class 0..1
String
The specific class or type within a schema to which the resource conforms Information
conforms_to_schema 0..1
String
The schema or data model to which the resource conforms Information
created_by 0..1
String
The person or organization primarily responsible for creating the resource Information
created_on 0..1
Datetime
The date and time when the resource was created Information
doi 0..1
String
Digital Object Identifier (DOI) in format 10 Information
download_url 0..1
Uri
URL from which the data can be downloaded Information
issued 0..1
Datetime
Date of formal issuance or publication of the resource Information
keywords *
String
Keywords or tags describing the resource for discovery and classification Information
language 0..1
String
Language in which the information is expressed Information
last_updated_on 0..1
Datetime
The date and time when the resource was most recently modified or updated Information
license 0..1
String
The legal license under which the resource is made available (e Information
modified_by 0..1
String
A person or organization that contributed to modifying or updating the resour... Information
page 0..1
String
A landing page or web page providing access to or information about the resou... Information
publisher 0..1
Uriorcurie
The organization or entity responsible for making the resource available Information
status 0..1
String
The status of the resource (e Information
title 0..1
String
The official title of the element Information
version 0..1
String
The version identifier of the resource (e Information
was_derived_from 0..1
String
A resource from which this resource was derived, in whole or in part Information
id 1
Uriorcurie
A unique identifier for a thing NamedThing
name 0..1
String
A human-readable name for a thing NamedThing
description 0..1
String
A human-readable description for a thing NamedThing

Usages

used by used in type used
FileCollection resources any_of[range] File

Aliases

  • data file
  • file
  • file object

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/bridge2ai/data-sheets-schema

Mappings

Mapping Type Mapped Value
self schema:MediaObject
native data_sheets_schema:File
exact schema:DigitalDocument

LinkML Source

Direct

name: File
description: A single file within a dataset or file collection. Represents an individual
  data file, code file, documentation file, etc. Maps to RO-Crate File entities.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
aliases:
- data file
- file
- file object
exact_mappings:
- schema:DigitalDocument
is_a: Information
slots:
- bytes
- path
- format
- encoding
- compression
- media_type
- hash
- md5
- sha256
- dialect
attributes:
  file_type:
    name: file_type
    description: Semantic type or purpose of this file (e.g., data_file, code_file,
      documentation_file, metadata_file).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
    rank: 1000
    slot_uri: d4d:fileType
    domain_of:
    - File
    range: FileTypeEnum
class_uri: schema:MediaObject

Induced

name: File
description: A single file within a dataset or file collection. Represents an individual
  data file, code file, documentation file, etc. Maps to RO-Crate File entities.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
aliases:
- data file
- file
- file object
exact_mappings:
- schema:DigitalDocument
is_a: Information
attributes:
  file_type:
    name: file_type
    description: Semantic type or purpose of this file (e.g., data_file, code_file,
      documentation_file, metadata_file).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
    rank: 1000
    slot_uri: d4d:fileType
    alias: file_type
    owner: File
    domain_of:
    - File
    range: FileTypeEnum
  bytes:
    name: bytes
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: '10485760'
    description: Size of the data in bytes.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcat:byteSize
    alias: bytes
    owner: File
    domain_of:
    - File
    range: integer
  path:
    name: path
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: data/ai_readi/participants.csv
    description: The file path or URL where the content is located.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:contentUrl
    alias: path
    owner: File
    domain_of:
    - File
    - FileCollection
    range: string
  format:
    name: format
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: CSV
    description: The file format, physical medium, or dimensions of a resource. This
      should be a file extension or MIME type.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:format
    alias: format
    owner: File
    domain_of:
    - File
    range: FormatEnum
  encoding:
    name: encoding
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: UTF-8
    description: The character encoding of the data.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: d4d:characterEncoding
    alias: encoding
    owner: File
    domain_of:
    - File
    range: EncodingEnum
  compression:
    name: compression
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: zip
    description: Compression format used, if any (e.g., gzip, bzip2, zip).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcat:compressFormat
    alias: compression
    owner: File
    domain_of:
    - Information
    - File
    - FileCollection
    range: CompressionEnum
  media_type:
    name: media_type
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: text/csv
    description: The media type of the data. This should be a MIME type.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - schema:encodingFormat
    rank: 1000
    slot_uri: dcat:mediaType
    alias: media_type
    owner: File
    domain_of:
    - File
    range: MediaTypeEnum
  hash:
    name: hash
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
    description: 'Cryptographic hash value of the data for integrity verification
      (e.g., SHA-256: ''e3b0c44298fc1c149afb...'', MD5: ''d41d8cd98f00b204e9800998ecf8427e'').'
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    broad_mappings:
    - dcterms:identifier
    rank: 1000
    slot_uri: d4d:hashValue
    alias: hash
    owner: File
    domain_of:
    - File
    range: string
  md5:
    name: md5
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: d41d8cd98f00b204e9800998ecf8427e
    description: MD5 hash value of the data (128-bit cryptographic hash).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    broad_mappings:
    - dcterms:identifier
    rank: 1000
    slot_uri: d4d:md5Checksum
    alias: md5
    owner: File
    domain_of:
    - File
    range: string
  sha256:
    name: sha256
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
    description: SHA-256 hash value of the data (256-bit cryptographic hash, recommended).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:sha256
    alias: sha256
    owner: File
    domain_of:
    - File
    range: string
  dialect:
    name: dialect
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: RFC4180
    description: Specific format dialect or variation (e.g., CSV dialect, JSON-LD
      profile).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:encodingFormat
    alias: dialect
    owner: File
    domain_of:
    - File
    range: string
  conforms_to:
    name: conforms_to
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://www.w3.org/TR/vocab-dcat-3/
    description: An established standard, specification, or schema to which the resource
      conforms.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:conformsTo
    alias: conforms_to
    owner: File
    domain_of:
    - Information
    range: string
  conforms_to_class:
    name: conforms_to_class
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: Dataset
    description: The specific class or type within a schema to which the resource
      conforms.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    broad_mappings:
    - dcterms:conformsTo
    rank: 1000
    slot_uri: d4d:conformsToClass
    alias: conforms_to_class
    owner: File
    domain_of:
    - Information
    range: string
  conforms_to_schema:
    name: conforms_to_schema
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://w3id.org/bridge2ai/data-sheets-schema
    description: The schema or data model to which the resource conforms.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    broad_mappings:
    - dcterms:conformsTo
    rank: 1000
    slot_uri: d4d:conformsToSchema
    alias: conforms_to_schema
    owner: File
    domain_of:
    - Information
    range: string
  created_by:
    name: created_by
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: orcid:0000-0002-1234-5678
    description: The person or organization primarily responsible for creating the
      resource.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:creator
    alias: created_by
    owner: File
    domain_of:
    - Information
    range: string
  created_on:
    name: created_on
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: '2023-07-18T00:00:00'
    description: The date and time when the resource was created.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:created
    alias: created_on
    owner: File
    domain_of:
    - Information
    range: datetime
  doi:
    name: doi
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: 10.5281/zenodo.10642459
    description: Digital Object Identifier (DOI) in format 10.xxxx/xxxxx providing
      persistent identification (e.g., '10.1038/s41586-020-2649-2', '10.5281/zenodo.1234567').
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - schema:identifier
    broad_mappings:
    - dcterms:identifier
    rank: 1000
    slot_uri: d4d:doiIdentifier
    alias: doi
    owner: File
    domain_of:
    - Information
    range: string
    pattern: 10\.\d{4,}\/.+
  download_url:
    name: download_url
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://fairhub.io/datasets/2/download
    description: URL from which the data can be downloaded. This is not the same as
      the landing page, which is a page that describes the dataset. Rather, this URL
      points directly to the data itself.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - schema:url
    rank: 1000
    slot_uri: dcat:downloadURL
    alias: download_url
    owner: File
    domain_of:
    - Information
    range: uri
  issued:
    name: issued
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: '2024-11-15T00:00:00'
    description: Date of formal issuance or publication of the resource.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:issued
    alias: issued
    owner: File
    domain_of:
    - Information
    range: datetime
  keywords:
    name: keywords
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: diabetes, retinal imaging, multimodal, clinical data
    description: Keywords or tags describing the resource for discovery and classification.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcat:keyword
    alias: keywords
    owner: File
    domain_of:
    - Information
    range: string
    multivalued: true
  language:
    name: language
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: en
    description: Language in which the information is expressed.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - schema:inLanguage
    rank: 1000
    slot_uri: dcterms:language
    alias: language
    owner: File
    domain_of:
    - Information
    range: string
  last_updated_on:
    name: last_updated_on
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: '2024-11-15T00:00:00'
    description: The date and time when the resource was most recently modified or
      updated.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:modified
    alias: last_updated_on
    owner: File
    domain_of:
    - Information
    range: datetime
  license:
    name: license
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: CC-BY-NC-4.0
    description: The legal license under which the resource is made available (e.g.,
      "MIT", "CC-BY-4.0").
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:license
    alias: license
    owner: File
    domain_of:
    - Software
    - Information
    range: string
  modified_by:
    name: modified_by
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: orcid:0000-0002-9876-5432
    description: A person or organization that contributed to modifying or updating
      the resource.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:contributor
    alias: modified_by
    owner: File
    domain_of:
    - Information
    range: string
  page:
    name: page
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://fairhub.io/datasets/2
    description: A landing page or web page providing access to or information about
      the resource.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcat:landingPage
    alias: page
    owner: File
    domain_of:
    - Information
    range: string
  publisher:
    name: publisher
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: 'ror:04t3en479  # use a ROR ID, DOI, or URL — not a plain name'
    description: The organization or entity responsible for making the resource available.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:publisher
    alias: publisher
    owner: File
    domain_of:
    - Information
    range: uriorcurie
  status:
    name: status
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: published
    description: The status of the resource (e.g., draft, published, deprecated).
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: d4d:publicationStatus
    alias: status
    owner: File
    domain_of:
    - Information
    range: string
  title:
    name: title
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: 'AI-READI: Salutogenesis Study of Type 2 Diabetes'
    description: The official title of the element.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: dcterms:title
    alias: title
    owner: File
    domain_of:
    - Information
    range: string
  version:
    name: version
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: 2.0.0
    description: The version identifier of the resource (e.g., "1.0", "2.3.1").
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:version
    alias: version
    owner: File
    domain_of:
    - Software
    - Information
    range: string
  was_derived_from:
    name: was_derived_from
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://fairhub.io/datasets/2/versions/1
    description: A resource from which this resource was derived, in whole or in part.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - dcterms:source
    rank: 1000
    slot_uri: prov:wasDerivedFrom
    alias: was_derived_from
    owner: File
    domain_of:
    - Information
    range: string
  id:
    name: id
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: https://example.org/dataset/my-dataset-001
    description: A unique identifier for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
    rank: 1000
    slot_uri: schema:identifier
    identifier: true
    alias: id
    owner: File
    domain_of:
    - NamedThing
    - DatasetProperty
    range: uriorcurie
    required: true
  name:
    name: name
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: AI-READI Dataset
    description: A human-readable name for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: File
    domain_of:
    - NamedThing
    - DatasetProperty
    range: string
  description:
    name: description
    annotations:
      d4d:docExample:
        tag: d4d:docExample
        value: A multimodal dataset of 4,000 participants with Type 2 Diabetes.
    description: A human-readable description for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
    rank: 1000
    slot_uri: schema:description
    alias: description
    owner: File
    domain_of:
    - NamedThing
    - DatasetProperty
    - DatasetRelationship
    range: string
class_uri: schema:MediaObject