Skip to content

Class: DataSet

Represents a data set by its metadata. This may or may not be produced by a group in the Bridge2AI consortium.

URI: https://w3id.org/bridge2ai/standards-schema-all/DataSet

classDiagram class DataSet click DataSet href "../DataSet/" NamedThing <|-- DataSet click NamedThing href "../NamedThing/" DataSet : availability_description DataSet : category DataSet : contribution_date DataSet : contributor_github_name DataSet : contributor_name DataSet : contributor_orcid DataSet : data_url DataSet : datasheet_url DataSet : description DataSet : documentation_url DataSet : has_application DataSet --> "*" Application : has_application click Application href "../Application/" DataSet : has_files DataSet : has_parts DataSet --> "*" DataSet : has_parts click DataSet href "../DataSet/" DataSet : id DataSet : is_bridge2ai_data DataSet : is_public DataSet : name DataSet : produced_by DataSet --> "*" Organization : produced_by click Organization href "../Organization/" DataSet : related_to DataSet --> "*" NamedThing : related_to click NamedThing href "../NamedThing/" DataSet : subclass_of DataSet --> "*" NamedThing : subclass_of click NamedThing href "../NamedThing/" DataSet : substrates DataSet --> "*" DataSubstrate : substrates click DataSubstrate href "../DataSubstrate/" DataSet : topics DataSet --> "*" DataTopic : topics click DataTopic href "../DataTopic/" DataSet : used_in_bridge2ai

Inheritance

Slots

Name Cardinality and Range Description Inheritance
has_files *
String
Subject data set has the file(s) listed in this slot as parts direct
has_parts *
DataSet
Subject data set has the data set(s) listed in this slot as parts direct
produced_by *
Organization
Subject data set was produced by the organization(s) listed in this slot direct
data_url 0..1
Uriorcurie
URL where the data set can be accessed direct
documentation_url 0..1
Uriorcurie
URL where documentation for the data set can be accessed direct
datasheet_url 0..1
Uriorcurie
URL where the datasheet for the data set can be accessed direct
is_public 0..1
Boolean
True if the data set is publicly available direct
substrates *
DataSubstrate
Subject data set is relevant to the substrate(s) listed in this slot direct
topics *
DataTopic
Subject data set is relevant to the topic(s) listed in this slot direct
is_bridge2ai_data 0..1
Boolean
True if the data set is produced by a Bridge2AI consortium group direct
availability_description 0..1
String
A description of the availability of the data set, including any restrictions... direct
id 1
Uriorcurie
A unique identifier for a thing NamedThing
category 0..1
CategoryType
CURIE for the high level ontology class in which this entity is categorized NamedThing
name 0..1
String
A human-readable name for a thing NamedThing
description 0..1
String
A human-readable description for a thing NamedThing
subclass_of *
NamedThing
Holds between two classes where the domain class is a specialization of the r... NamedThing
related_to *
NamedThing
A relationship that is asserted between two named things NamedThing
contributor_name 0..1
String
The name of the person who added this node NamedThing
contributor_github_name 0..1
String
The name of the github user who added this node NamedThing
contributor_orcid 0..1
Uriorcurie
The ORCiD of the person who added this node NamedThing
contribution_date 0..1
Date
The date on which the node was added NamedThing
used_in_bridge2ai 0..1
Boolean
True if the entity is used, developed, or otherwise related to work in the Br... NamedThing
has_application *
Application
A list of one or more specific applications of this entity to a specific purp... NamedThing

Usages

used by used in type used
DataSet has_files domain DataSet
DataSet has_parts domain DataSet
DataSet has_parts range DataSet
DataSet produced_by domain DataSet
DataSet data_url domain DataSet
DataSet documentation_url domain DataSet
DataSet datasheet_url domain DataSet
DataSet is_public domain DataSet
DataSet substrates domain DataSet
DataSet topics domain DataSet
DataSet is_bridge2ai_data domain DataSet
DataSet availability_description domain DataSet
DataSetContainer data_collection range DataSet
Manifest datasets range DataSet

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/bridge2ai/standards-schema-all

Mappings

Mapping Type Mapped Value
self https://w3id.org/bridge2ai/standards-schema-all/DataSet
native https://w3id.org/bridge2ai/standards-schema-all/DataSet

LinkML Source

Direct

name: DataSet
description: Represents a data set by its metadata. This may or may not be produced
  by a group in the Bridge2AI consortium.
from_schema: https://w3id.org/bridge2ai/standards-schema-all
is_a: NamedThing
slots:
- has_files
- has_parts
- produced_by
- data_url
- documentation_url
- datasheet_url
- is_public
- substrates
- topics
- is_bridge2ai_data
- availability_description

Induced

name: DataSet
description: Represents a data set by its metadata. This may or may not be produced
  by a group in the Bridge2AI consortium.
from_schema: https://w3id.org/bridge2ai/standards-schema-all
is_a: NamedThing
attributes:
  has_files:
    name: has_files
    description: Subject data set has the file(s) listed in this slot as parts. Note
      that each file is not a data set and does not have its own ID in the registry.
      For data sets that are parts of this data set, use the `has_parts` slot.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: related_to
    domain: DataSet
    inherited: true
    alias: has_files
    owner: DataSet
    domain_of:
    - DataSet
    range: string
    multivalued: true
  has_parts:
    name: has_parts
    description: Subject data set has the data set(s) listed in this slot as parts.
      Note that each part is itself a data set, with its own ID in the registry. For
      specific files, use the `has_files` slot.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: related_to
    domain: DataSet
    inherited: true
    alias: has_parts
    owner: DataSet
    domain_of:
    - DataSet
    range: DataSet
    multivalued: true
  produced_by:
    name: produced_by
    description: Subject data set was produced by the organization(s) listed in this
      slot. Must be an Organization object, referenced with its B2AI_ORG ID.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: related_to
    domain: DataSet
    inherited: true
    alias: produced_by
    owner: DataSet
    domain_of:
    - DataSet
    range: Organization
    multivalued: true
  data_url:
    name: data_url
    description: URL where the data set can be accessed.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: data_url
    owner: DataSet
    domain_of:
    - DataSet
    range: uriorcurie
  documentation_url:
    name: documentation_url
    description: URL where documentation for the data set can be accessed.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: documentation_url
    owner: DataSet
    domain_of:
    - DataSet
    range: uriorcurie
  datasheet_url:
    name: datasheet_url
    description: URL where the datasheet for the data set can be accessed.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: datasheet_url
    owner: DataSet
    domain_of:
    - DataSet
    range: uriorcurie
  is_public:
    name: is_public
    description: True if the data set is publicly available. False if the data set
      is private or requires special permissions to access.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: is_public
    owner: DataSet
    domain_of:
    - DataSet
    range: boolean
  substrates:
    name: substrates
    description: Subject data set is relevant to the substrate(s) listed in this slot.
      Must be a DataSubstrate object, referenced with its B2AI_SUBSTRATE ID.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: related_to
    domain: DataSet
    inherited: true
    alias: substrates
    owner: DataSet
    domain_of:
    - DataSet
    range: DataSubstrate
    multivalued: true
  topics:
    name: topics
    description: Subject data set is relevant to the topic(s) listed in this slot.
      Must be a DataTopic object, referenced with its B2AI_TOPIC ID.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: related_to
    domain: DataSet
    inherited: true
    alias: topics
    owner: DataSet
    domain_of:
    - DataSet
    range: DataTopic
    multivalued: true
  is_bridge2ai_data:
    name: is_bridge2ai_data
    description: True if the data set is produced by a Bridge2AI consortium group.
      False if the data set is not produced by a Bridge2AI consortium group.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: is_bridge2ai_data
    owner: DataSet
    domain_of:
    - DataSet
    range: boolean
    required: false
  availability_description:
    name: availability_description
    description: A description of the availability of the data set, including any
      restrictions on access or use.
    examples:
    - value: '"Datasets require additional permissions. Please visit https://example.com
        for more information."'
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: DataSet
    alias: availability_description
    owner: DataSet
    domain_of:
    - DataSet
    range: string
    required: false
  id:
    name: id
    description: A unique identifier for a thing.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    slot_uri: schema:identifier
    identifier: true
    alias: id
    owner: DataSet
    domain_of:
    - NamedThing
    range: uriorcurie
    required: true
  category:
    name: category
    description: CURIE for the high level ontology class in which this entity is categorized.
      Corresponds to the label for the entity type class, e.g., "B2AI_STANDARD:DataStandard".
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: type
    domain: NamedThing
    designates_type: true
    alias: category
    owner: DataSet
    domain_of:
    - NamedThing
    range: category_type
  name:
    name: name
    description: A human-readable name for a thing.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: DataSet
    domain_of:
    - NamedThing
    range: string
  description:
    name: description
    description: A human-readable description for a thing.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    slot_uri: schema:description
    alias: description
    owner: DataSet
    domain_of:
    - NamedThing
    range: string
  subclass_of:
    name: subclass_of
    description: Holds between two classes where the domain class is a specialization
      of the range class.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    exact_mappings:
    - rdfs:subClassOf
    - MESH:isa
    narrow_mappings:
    - rdfs:subPropertyOf
    rank: 1000
    is_a: related_to
    domain: NamedThing
    inherited: true
    alias: subclass_of
    owner: DataSet
    domain_of:
    - NamedThing
    range: NamedThing
    multivalued: true
  related_to:
    name: related_to
    description: A relationship that is asserted between two named things.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    domain: NamedThing
    inherited: true
    alias: related_to
    owner: DataSet
    domain_of:
    - NamedThing
    - Organization
    symmetric: true
    range: NamedThing
    multivalued: true
  contributor_name:
    name: contributor_name
    description: The name of the person who added this node.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_name
    owner: DataSet
    domain_of:
    - NamedThing
    range: string
  contributor_github_name:
    name: contributor_github_name
    description: The name of the github user who added this node.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_github_name
    owner: DataSet
    domain_of:
    - NamedThing
    range: string
  contributor_orcid:
    name: contributor_orcid
    description: The ORCiD of the person who added this node.
    examples:
    - value: ORCID:0000-0001-1234-5678
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_orcid
    owner: DataSet
    domain_of:
    - NamedThing
    range: uriorcurie
  contribution_date:
    name: contribution_date
    description: The date on which the node was added.
    examples:
    - value: '2023-03-20'
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contribution_date
    owner: DataSet
    domain_of:
    - NamedThing
    range: date
  used_in_bridge2ai:
    name: used_in_bridge2ai
    description: True if the entity is used, developed, or otherwise related to work
      in the Bridge2AI consortium. If false, the entity is not explicitly related
      to Bridge2AI. If not specified, it is not known if the entity is related to
      Bridge2AI.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: used_in_bridge2ai
    owner: DataSet
    domain_of:
    - NamedThing
    range: boolean
  has_application:
    name: has_application
    description: A list of one or more specific applications of this entity to a specific
      purpose.
    from_schema: https://w3id.org/bridge2ai/standards-schema-all
    rank: 1000
    domain: NamedThing
    alias: has_application
    owner: DataSet
    domain_of:
    - NamedThing
    range: Application
    multivalued: true
    inlined: true
    inlined_as_list: true