Class: FileCollection
A collection of files with shared characteristics (format, purpose, structure). Represents a logical grouping of related files within a dataset, such as all training data files, all image files, or all raw data files. Maps to RO-Crate Dataset entities via schema:hasPart relationships.
URI: dcat:Dataset
classDiagram
class FileCollection
click FileCollection href "../FileCollection/"
Information <|-- FileCollection
click Information href "../Information/"
FileCollection : collection_type
FileCollection --> "*" FileCollectionTypeEnum : collection_type
click FileCollectionTypeEnum href "../FileCollectionTypeEnum/"
FileCollection : compression
FileCollection --> "0..1" CompressionEnum : compression
click CompressionEnum href "../CompressionEnum/"
FileCollection : conforms_to
FileCollection : conforms_to_class
FileCollection : conforms_to_schema
FileCollection : created_by
FileCollection : created_on
FileCollection : description
FileCollection : doi
FileCollection : download_url
FileCollection : external_resources
FileCollection --> "*" ExternalResource : external_resources
click ExternalResource href "../ExternalResource/"
FileCollection : file_count
FileCollection : id
FileCollection : issued
FileCollection : keywords
FileCollection : language
FileCollection : last_updated_on
FileCollection : license
FileCollection : modified_by
FileCollection : name
FileCollection : page
FileCollection : path
FileCollection : publisher
FileCollection : resources
FileCollection --> "*" Dataset : resources
click Dataset href "../Dataset/"
FileCollection : status
FileCollection : title
FileCollection : total_bytes
FileCollection : version
FileCollection : was_derived_from
Inheritance
- NamedThing
- Information
- FileCollection
- Information
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| path | 0..1 String |
Path or URL to the FileCollection | direct |
| compression | 0..1 CompressionEnum |
Compression format if the collection is packaged as a compressed archive (e | direct |
| external_resources | * ExternalResource |
External files or URLs referenced by this file collection | direct |
| resources | * Dataset or File or FileCollection |
Individual files or nested file collections within this collection | direct |
| collection_type | * FileCollectionTypeEnum |
Type(s) of content in this file collection | direct |
| file_count | 0..1 Integer |
Number of files in this collection | direct |
| total_bytes | 0..1 Integer |
Total size of all files in bytes | direct |
| conforms_to | 0..1 String |
Information | |
| conforms_to_class | 0..1 String |
Information | |
| conforms_to_schema | 0..1 String |
Information | |
| created_by | 0..1 String |
Information | |
| created_on | 0..1 Datetime |
Information | |
| doi | 0..1 String |
digital object identifier | Information |
| download_url | 0..1 Uri |
URL from which the data can be downloaded | Information |
| issued | 0..1 Datetime |
Information | |
| keywords | * String |
Information | |
| language | 0..1 String |
language in which the information is expressed | Information |
| last_updated_on | 0..1 Datetime |
Information | |
| license | 0..1 String |
Information | |
| modified_by | 0..1 String |
Information | |
| page | 0..1 String |
Information | |
| publisher | 0..1 Uriorcurie |
Information | |
| status | 0..1 String |
Information | |
| title | 0..1 String |
the official title of the element | Information |
| version | 0..1 String |
Information | |
| was_derived_from | 0..1 String |
Information | |
| id | 1 Uriorcurie |
A unique identifier for a thing | NamedThing |
| name | 0..1 String |
A human-readable name for a thing | NamedThing |
| description | 0..1 String |
A human-readable description for a thing | NamedThing |
Usages
| used by | used in | type | used |
|---|---|---|---|
| Dataset | file_collections | range | FileCollection |
| DataSubset | file_collections | range | FileCollection |
| FileCollection | resources | any_of[range] | FileCollection |
Aliases
- file collection
- data files
- file group
Identifier and Mapping Information
Schema Source
- from schema: https://w3id.org/bridge2ai/data-sheets-schema
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | dcat:Dataset |
| native | data_sheets_schema:FileCollection |
| exact | schema:Dataset |
| close | dcat:Distribution |
LinkML Source
Direct
name: FileCollection
description: A collection of files with shared characteristics (format, purpose, structure).
Represents a logical grouping of related files within a dataset, such as all training
data files, all image files, or all raw data files. Maps to RO-Crate Dataset entities
via schema:hasPart relationships.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
aliases:
- file collection
- data files
- file group
exact_mappings:
- schema:Dataset
close_mappings:
- dcat:Distribution
is_a: Information
slots:
- path
- compression
- external_resources
- resources
slot_usage:
path:
name: path
description: Path or URL to the FileCollection. May be a directory path, archive
file path, or download URL depending on how the collection is distributed.
compression:
name: compression
description: Compression format if the collection is packaged as a compressed
archive (e.g., gzip, zip, bzip2). Omit this field for uncompressed collections
or purely logical groupings.
external_resources:
name: external_resources
description: External files or URLs referenced by this file collection.
range: ExternalResource
multivalued: true
inlined_as_list: true
resources:
name: resources
description: Individual files or nested file collections within this collection.
Allows hierarchical file organization with both File objects and nested FileCollection
objects.
multivalued: true
inlined_as_list: true
any_of:
- range: File
- range: FileCollection
attributes:
collection_type:
name: collection_type
description: Type(s) of content in this file collection. A collection may have
multiple types, for example a collection containing both raw_data and documentation
files would have both types listed.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: d4d:collectionType
domain_of:
- FileCollection
range: FileCollectionTypeEnum
multivalued: true
file_count:
name: file_count
description: Number of files in this collection.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: d4d:fileCount
domain_of:
- FileCollection
range: integer
total_bytes:
name: total_bytes
description: Total size of all files in bytes.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: dcat:byteSize
domain_of:
- FileCollection
range: integer
class_uri: dcat:Dataset
Induced
name: FileCollection
description: A collection of files with shared characteristics (format, purpose, structure).
Represents a logical grouping of related files within a dataset, such as all training
data files, all image files, or all raw data files. Maps to RO-Crate Dataset entities
via schema:hasPart relationships.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
aliases:
- file collection
- data files
- file group
exact_mappings:
- schema:Dataset
close_mappings:
- dcat:Distribution
is_a: Information
slot_usage:
path:
name: path
description: Path or URL to the FileCollection. May be a directory path, archive
file path, or download URL depending on how the collection is distributed.
compression:
name: compression
description: Compression format if the collection is packaged as a compressed
archive (e.g., gzip, zip, bzip2). Omit this field for uncompressed collections
or purely logical groupings.
external_resources:
name: external_resources
description: External files or URLs referenced by this file collection.
range: ExternalResource
multivalued: true
inlined_as_list: true
resources:
name: resources
description: Individual files or nested file collections within this collection.
Allows hierarchical file organization with both File objects and nested FileCollection
objects.
multivalued: true
inlined_as_list: true
any_of:
- range: File
- range: FileCollection
attributes:
collection_type:
name: collection_type
description: Type(s) of content in this file collection. A collection may have
multiple types, for example a collection containing both raw_data and documentation
files would have both types listed.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: d4d:collectionType
alias: collection_type
owner: FileCollection
domain_of:
- FileCollection
range: FileCollectionTypeEnum
multivalued: true
file_count:
name: file_count
description: Number of files in this collection.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: d4d:fileCount
alias: file_count
owner: FileCollection
domain_of:
- FileCollection
range: integer
total_bytes:
name: total_bytes
description: Total size of all files in bytes.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/file-collection
rank: 1000
slot_uri: dcat:byteSize
alias: total_bytes
owner: FileCollection
domain_of:
- FileCollection
range: integer
path:
name: path
description: Path or URL to the FileCollection. May be a directory path, archive
file path, or download URL depending on how the collection is distributed.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: schema:contentUrl
alias: path
owner: FileCollection
domain_of:
- File
- FileCollection
range: string
compression:
name: compression
description: Compression format if the collection is packaged as a compressed
archive (e.g., gzip, zip, bzip2). Omit this field for uncompressed collections
or purely logical groupings.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcat:compressFormat
alias: compression
owner: FileCollection
domain_of:
- Information
- File
- FileCollection
range: CompressionEnum
external_resources:
name: external_resources
description: External files or URLs referenced by this file collection.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:references
alias: external_resources
owner: FileCollection
domain_of:
- Dataset
- ExternalResource
- FileCollection
range: ExternalResource
multivalued: true
inlined_as_list: true
resources:
name: resources
description: Individual files or nested file collections within this collection.
Allows hierarchical file organization with both File objects and nested FileCollection
objects.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: schema:hasPart
alias: resources
owner: FileCollection
domain_of:
- DatasetCollection
- Dataset
- FileCollection
range: Dataset
multivalued: true
inlined_as_list: true
any_of:
- range: File
- range: FileCollection
conforms_to:
name: conforms_to
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:conformsTo
alias: conforms_to
owner: FileCollection
domain_of:
- Information
range: string
conforms_to_class:
name: conforms_to_class
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:conformsTo
alias: conforms_to_class
owner: FileCollection
domain_of:
- Information
range: string
conforms_to_schema:
name: conforms_to_schema
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:conformsTo
alias: conforms_to_schema
owner: FileCollection
domain_of:
- Information
range: string
created_by:
name: created_by
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:creator
alias: created_by
owner: FileCollection
domain_of:
- Information
range: string
created_on:
name: created_on
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:created
alias: created_on
owner: FileCollection
domain_of:
- Information
range: datetime
doi:
name: doi
description: digital object identifier
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:identifier
alias: doi
owner: FileCollection
domain_of:
- Information
range: string
pattern: 10\.\d{4,}\/.+
download_url:
name: download_url
description: URL from which the data can be downloaded. This is not the same as
the landing page, which is a page that describes the dataset. Rather, this URL
points directly to the data itself.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
exact_mappings:
- schema:url
rank: 1000
slot_uri: dcat:downloadURL
alias: download_url
owner: FileCollection
domain_of:
- Information
range: uri
issued:
name: issued
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:issued
alias: issued
owner: FileCollection
domain_of:
- Information
range: datetime
keywords:
name: keywords
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcat:keyword
alias: keywords
owner: FileCollection
domain_of:
- Information
range: string
multivalued: true
language:
name: language
description: language in which the information is expressed
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
exact_mappings:
- schema:inLanguage
rank: 1000
slot_uri: dcterms:language
alias: language
owner: FileCollection
domain_of:
- Information
range: string
last_updated_on:
name: last_updated_on
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:modified
alias: last_updated_on
owner: FileCollection
domain_of:
- Information
range: datetime
license:
name: license
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:license
alias: license
owner: FileCollection
domain_of:
- Software
- Information
range: string
modified_by:
name: modified_by
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:contributor
alias: modified_by
owner: FileCollection
domain_of:
- Information
range: string
page:
name: page
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcat:landingPage
alias: page
owner: FileCollection
domain_of:
- Information
range: string
publisher:
name: publisher
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:publisher
alias: publisher
owner: FileCollection
domain_of:
- Information
range: uriorcurie
status:
name: status
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:type
alias: status
owner: FileCollection
domain_of:
- Information
range: string
title:
name: title
description: the official title of the element
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:title
alias: title
owner: FileCollection
domain_of:
- Information
range: string
version:
name: version
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:hasVersion
alias: version
owner: FileCollection
domain_of:
- Software
- Information
range: string
was_derived_from:
name: was_derived_from
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
exact_mappings:
- dcterms:source
rank: 1000
slot_uri: prov:wasDerivedFrom
alias: was_derived_from
owner: FileCollection
domain_of:
- Information
range: string
id:
name: id
description: A unique identifier for a thing.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
rank: 1000
slot_uri: schema:identifier
identifier: true
alias: id
owner: FileCollection
domain_of:
- NamedThing
- DatasetProperty
range: uriorcurie
required: true
name:
name: name
description: A human-readable name for a thing.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
rank: 1000
slot_uri: schema:name
alias: name
owner: FileCollection
domain_of:
- NamedThing
- DatasetProperty
range: string
description:
name: description
description: A human-readable description for a thing.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema/base
rank: 1000
slot_uri: schema:description
alias: description
owner: FileCollection
domain_of:
- NamedThing
- DatasetProperty
- DatasetRelationship
range: string
class_uri: dcat:Dataset