Skip to content

Class: SamplingStrategy

Does the dataset contain all possible instances or is it a sample (not necessarily random) of instances from a larger set? If the dataset is a sample, then what is the larger set? Is the sample representative of the larger set (e.g., geographic coverage)? If so, please describe how this representativeness was validated/verified. If it is not representative of the larger set, please describe why not (e.g., to cover a more diverse range of instances, because instances were withheld or unavailable).

URI: data_sheets_schema:SamplingStrategy

classDiagram class SamplingStrategy DatasetProperty <|-- SamplingStrategy SamplingStrategy : category SamplingStrategy : contribution_date SamplingStrategy : contributor_github_name SamplingStrategy : contributor_name SamplingStrategy : contributor_orcid SamplingStrategy : description SamplingStrategy : id SamplingStrategy : is_random SamplingStrategy : is_representative SamplingStrategy : is_sample SamplingStrategy : name SamplingStrategy : related_to SamplingStrategy --|> NamedThing : related_to SamplingStrategy : representative_verification SamplingStrategy : source_data SamplingStrategy : strategies SamplingStrategy : subclass_of SamplingStrategy --|> NamedThing : subclass_of SamplingStrategy : used_software SamplingStrategy --|> Software : used_software SamplingStrategy : why_not_representative

Inheritance

Slots

Name Cardinality and Range Description Inheritance
is_sample 0..*
Boolean
direct
is_random 0..*
Boolean
direct
source_data 0..*
String
direct
is_representative 0..*
Boolean
direct
representative_verification 0..*
String
direct
why_not_representative 0..*
String
direct
strategies 0..*
String
If the dataset is a sample from a larger set, what was the sampling strategy ... direct
used_software 0..*
Software
What software was used as part of this dataset property? DatasetProperty
id 1..1
Uriorcurie
A unique identifier for a thing NamedThing
category 0..1
CategoryType
Name of the high level ontology class in which this entity is categorized NamedThing
name 0..1
String
A human-readable name for a thing NamedThing
description 0..1
String
A human-readable description for a thing NamedThing
subclass_of 0..*
NamedThing
Holds between two classes where the domain class is a specialization of the r... NamedThing
related_to 0..*
NamedThing
A relationship that is asserted between two named things NamedThing
contributor_name 0..1
String
The name of the person who added this node NamedThing
contributor_github_name 0..1
String
The name of the github user who added this node NamedThing
contributor_orcid 0..1
Uriorcurie
The ORCiD of the person who added this node NamedThing
contribution_date 0..1
Date
The date on which the node was added NamedThing

Usages

used by used in type used
Dataset sampling_strategies range SamplingStrategy
DataSubset sampling_strategies range SamplingStrategy
Instance sampling_strategies range SamplingStrategy

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/bridge2ai/data-sheets-schema

Mappings

Mapping Type Mapped Value
self data_sheets_schema:SamplingStrategy
native data_sheets_schema:SamplingStrategy

LinkML Source

Direct

name: SamplingStrategy
description: Does the dataset contain all possible instances or is it a sample (not
  necessarily random) of instances from a larger set? If the dataset is a sample,
  then what is the larger set? Is the sample representative of the larger set (e.g.,
  geographic coverage)? If so, please describe how this representativeness was validated/verified.
  If it is not representative of the larger set, please describe why not (e.g., to
  cover a more diverse range of instances, because instances were withheld or unavailable).
in_subset:
- Composition
- Collection
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
  is_sample:
    name: is_sample
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: boolean
  is_random:
    name: is_random
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: boolean
  source_data:
    name: source_data
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: string
  is_representative:
    name: is_representative
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: boolean
  representative_verification:
    name: representative_verification
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: string
  why_not_representative:
    name: why_not_representative
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: string
  strategies:
    name: strategies
    description: If the dataset is a sample from a larger set, what was the sampling
      strategy (e.g., deterministic, probabilistic with specific sampling probabilities)?
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    range: string

Induced

name: SamplingStrategy
description: Does the dataset contain all possible instances or is it a sample (not
  necessarily random) of instances from a larger set? If the dataset is a sample,
  then what is the larger set? Is the sample representative of the larger set (e.g.,
  geographic coverage)? If so, please describe how this representativeness was validated/verified.
  If it is not representative of the larger set, please describe why not (e.g., to
  cover a more diverse range of instances, because instances were withheld or unavailable).
in_subset:
- Composition
- Collection
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
  is_sample:
    name: is_sample
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: is_sample
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: boolean
  is_random:
    name: is_random
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: is_random
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: boolean
  source_data:
    name: source_data
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: source_data
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: string
  is_representative:
    name: is_representative
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: is_representative
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: boolean
  representative_verification:
    name: representative_verification
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: representative_verification
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: string
  why_not_representative:
    name: why_not_representative
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: why_not_representative
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: string
  strategies:
    name: strategies
    description: If the dataset is a sample from a larger set, what was the sampling
      strategy (e.g., deterministic, probabilistic with specific sampling probabilities)?
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: strategies
    owner: SamplingStrategy
    domain_of:
    - SamplingStrategy
    range: string
  used_software:
    name: used_software
    description: What software was used as part of this dataset property?
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: used_software
    owner: SamplingStrategy
    domain_of:
    - DatasetProperty
    range: Software
  id:
    name: id
    description: A unique identifier for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:identifier
    identifier: true
    alias: id
    owner: SamplingStrategy
    domain_of:
    - Information
    - FormatDialect
    - NamedThing
    range: uriorcurie
    required: true
  category:
    name: category
    description: Name of the high level ontology class in which this entity is categorized.
      Corresponds to the label for the entity type class, e.g., "BiomedicalStandard".
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    is_a: type
    domain: NamedThing
    designates_type: true
    alias: category
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: category_type
  name:
    name: name
    description: A human-readable name for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: string
  description:
    name: description
    description: A human-readable description for a thing.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:description
    alias: description
    owner: SamplingStrategy
    domain_of:
    - Information
    - Relationships
    - Splits
    - DataAnomaly
    - Confidentiality
    - Deidentification
    - SensitiveElement
    - InstanceAcquisition
    - CollectionMechanism
    - DataCollector
    - CollectionTimeframe
    - EthicalReview
    - DirectCollection
    - CollectionNotification
    - CollectionConsent
    - ConsentRevocation
    - DataProtectionImpact
    - PreprocessingStrategy
    - CleaningStrategy
    - LabelingStrategy
    - RawData
    - ExistingUse
    - UseRepository
    - OtherTask
    - FutureUseImpact
    - DiscouragedUse
    - ThirdPartySharing
    - DistributionFormat
    - DistributionDate
    - LicenseAndUseTerms
    - IPRestrictions
    - ExportControlRegulatoryRestrictions
    - Maintainer
    - Erratum
    - UpdatePlan
    - RetentionLimits
    - VersionAccess
    - ExtensionMechanism
    - NamedThing
    range: string
  subclass_of:
    name: subclass_of
    description: Holds between two classes where the domain class is a specialization
      of the range class.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - rdfs:subClassOf
    - MESH:isa
    narrow_mappings:
    - rdfs:subPropertyOf
    rank: 1000
    is_a: related_to
    domain: NamedThing
    multivalued: true
    inherited: true
    alias: subclass_of
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: NamedThing
  related_to:
    name: related_to
    description: A relationship that is asserted between two named things.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    domain: NamedThing
    multivalued: true
    inherited: true
    alias: related_to
    owner: SamplingStrategy
    domain_of:
    - Organization
    - NamedThing
    symmetric: true
    range: NamedThing
  contributor_name:
    name: contributor_name
    description: The name of the person who added this node.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_name
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: string
  contributor_github_name:
    name: contributor_github_name
    description: The name of the github user who added this node.
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_github_name
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: string
  contributor_orcid:
    name: contributor_orcid
    description: The ORCiD of the person who added this node.
    examples:
    - value: ORCID:0000-0001-1234-5678
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contributor_orcid
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: uriorcurie
  contribution_date:
    name: contribution_date
    description: The date on which the node was added.
    examples:
    - value: '2023-03-20'
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    is_a: node_property
    domain: NamedThing
    alias: contribution_date
    owner: SamplingStrategy
    domain_of:
    - NamedThing
    range: date