Matches in Nanopublications for { ?s <http://www.w3.org/2000/01/rdf-schema#comment> ?o ?g. }
- FAIRsharing.acd824 comment "The European Language Social Science Thesaurus (ELSST) is a broad-based, multilingual thesaurus for the social sciences. It is owned and published by the Consortium of European Social Science Data Archives (CESSDA) and its national Service Providers. The thesaurus consists of over 3,300 concepts and covers the core social science disciplines: politics, sociology, economics, education, law, crime, demography, health, employment, information, communication technology, and environmental science. ELSST is used for data discovery within CESSDA and facilitates access to data resources across Europe, independent of domain, resource, language, or vocabulary. ELSST is currently available in 16 languages: Danish, Dutch, Czech, English, Finnish, French, German, Greek, Hungarian, Icelandic, Lithuanian, Norwegian, Romanian, Slovenian, Spanish, and Swedish" assertion.
- FAIRsharing.rYgXhw comment "This document defines the core components of the Observation data model that are necessary to perform data discovery when querying data centers for astronomical observations of interest. It exposes use-cases to be carried out, explains the model and provides guidelines for its implementation as a data access service based on the Table Access Protocol (TAP). It aims at providing a simple model easy to understand and to implement by data providers that wish to publish their data into the Virtual Observatory. This interface integrates data modeling and data access aspects in a single service and is named ObsTAP. In this document, the Observation Data Model Core Components (ObsCoreDM) defines the core components of queryable metadata required for global discovery of observational data. It is meant to allow a single query to be posed to TAP services at multiple sites to perform global data discovery without having to understand the details of the services present at each site. It defines a minimal set of basic metadata and thus allows for a reasonable cost of implementation by data providers." assertion.
- assertion comment "testing language tags" assertion.
- research-question comment "Current geospatial systems face challenges with multi-scale global data due to projection distortions and indexing limitations. DGGS promises to address these issues, but no comprehensive synthesis exists comparing DGGS performance against established approaches. This review will provide evidence-based guidance for DGGS adoption." assertion.
- research-question comment "Current geospatial systems face challenges with multi-scale global data due to projection distortions and indexing limitations. DGGS promises to address these issues, but no comprehensive synthesis exists comparing DGGS performance against established approaches. This review will provide evidence-based guidance for DGGS adoption." assertion.
- research-question comment "Systematic mapping of Portugal's climate change literature to identify sector coverage, geographic distribution, and knowledge gaps for supporting national adaptation planning" assertion.
- research-question comment "The European Agroecology Partnership aims to create resilient and sustainable farming systems by 2050. LifeWatch ERIC leads WP5 on developing comprehensive frameworks, methodologies, indicators, and data management tools to monitor agroecology transition. This review supports evidence-based assessment of transition progress across EU living labs and research infrastructures." assertion.
- research-question comment "Quantum computing is emerging as a potential paradigm shift for ecology (Sherley et al. 2023, Clenet et al. 2025). Applications include conservation planning optimization (Microsoft/Marxan), species distribution modeling, population genetics, and ecological network analysis. This scoping review maps the current landscape to identify promising directions and gaps for the biodiversity research community." assertion.
- research-question comment "Multi-source Earth observation data cannot be directly fed to AI algorithms without costly spatial harmonization — including reprojection, resampling, and vector-raster conversion. This preprocessing bottleneck limits the scalability and reproducibility of machine learning workflows in EO. DGGS offers a potential solution by providing a standardized spatial index where heterogeneous datasets become directly associable via zone IDs, potentially eliminating traditional harmonization steps. However, no systematic synthesis exists evaluating DGGS effectiveness specifically for AI-ready data preparation. This review will assess whether DGGS can serve as a scalable, interoperable framework that enables direct ingestion of multi-source EO data into AI pipelines." assertion.
- systematic-review comment "TITLE-ABS-KEY ( ( DGGS OR HEALPix ) ) AND PUBYEAR = 2024 AND ( LIMIT-TO ( LANGUAGE , "English" ) )" assertion.
- PubChemCDB comment "The PubChem Compound Database contains validated chemical depiction information provided to describe substances in PubChem Substance. Structures stored within PubChem Compounds are pre-clustered and cross-referenced by identity and similarity groups." assertion.
- PubChemCDB comment "The PubChem Compound Database contains validated chemical depiction information provided to describe substances in PubChem Substance. Structures stored within PubChem Compounds are pre-clustered and cross-referenced by identity and similarity groups." assertion.
- PubChemCDB comment "The PubChem Compound Database contains validated chemical depiction information provided to describe substances in PubChem Substance. Structures stored within PubChem Compounds are pre-clustered and cross-referenced by identity and similarity groups." assertion.
- systematic-review comment "bla bla" assertion.
- DANS-DataStationSSH comment "This FIP is created by the managers of the DANS Data Station Social Sciences and Humanities (SSH) to provide an overview of the FAIR Implementation Choices of the DANS Data Station SSH." assertion.
- Manager-DANS-DataStationSSH comment "This FIP is created by the managers of the DANS Data Station Social Sciences and Humanities (SSH) to provide an overview of the FAIR Implementation Choices of the DANS Data Station SSH." assertion.
- Managers-DANS-DataStationSSH comment "This FIP is created by the managers of the DANS Data Station Social Sciences and Humanities (SSH) to provide an overview of the FAIR Implementation Choices of the DANS Data Station SSH." assertion.
- _2 comment "Oral history." assertion.
- _2 comment "Personal experience as a former chruch goer and some one who attends live music events" assertion.
- _2 comment "https://www.asiliaafrica.com/blog/the-meaning-in-maasai-beading/" assertion.
- _2 comment "https://beninhistory.org/origin-of-benin-art/f/edo-symbols-of-power-and-authority-part-3-ukhure" assertion.
- _2 comment "https://www.woutersgallery.com/artists/cyprien-tokoudagba-(1939-2012) https://fr.wikipedia.org/wiki/Cyprien_Tokoudagba" assertion.
- _2 comment "https://www.asiliaafrica.com/blog/the-meaning-in-maasai-beading/" assertion.
- assertion comment "Moved to a Space." assertion.
- assertion comment "was just for testing" assertion.
- _2 comment "I received one when I first went to my grandfather's village in Biak" assertion.
- assertion comment "Nanopublication is republished under "Making a statement with Wikidata Vocabulary" template." assertion.
- assertion comment "retract to create a new introduction to connect with new publications." assertion.
- RAmnA8VFfVGVI0aeveyoFKujwsJNlj_KPpcaZkBm5qL0I comment "I've been watching his animated films since I was a kid." assertion.
- assertion comment "test" assertion.
- assertion comment "wrong" assertion.
- assertion comment "wrong" assertion.
- assertion comment "example" assertion.
- RAuWOCSXWqleEF6BM3-FGPrU4SNvHtz2HPpcmkJZgK4bo comment "Ataturk, occupation" assertion.
- assertion comment "Wrong Template" assertion.
- proceedings2019038011 comment "This paper concludes that modelling of agricultural and forestry biomass productivity are needed." assertion.
- assertion comment "The paper explains that most tools do not have sufficient metadata and advocate for the usage of ontologies such as the EDAM Ontology to annotate tools and workflows." assertion.
- feart.2023.1245815 comment "This paper uses HEALPix to regrid datasets with different resolutions and argues that it is the most suitable approach for enabling proper comparison." assertion.
- Continuous_Integration_101 comment "An introduction to Continuous Integration" assertion.
- assertion comment "The members of this community are referred to in the singular (manager) when it should be plural (managers). Therefore, the wording of this FIC is retracted." assertion.
- credit comment "Testing the awarding of Science Live credits... :)" assertion.
- assertion comment "Replace this with your assertion content." assertion.
- N-Triples comment "N-Triples was introduced as part of the W3C RDF Working Group’s efforts to create a simpler, more machine-friendly format for RDF data. It was developed alongside Turtle and became a W3C standard in 2014. NTriples is a line-based, plain text serialization format for RDF graphs. It was designed to be easier for software to parse and generate, but less readable for humans due to its minimal syntax. It lacks some of the shortcuts that are present in other RDF serializations such as Turtle (e.g., prefixes). Being line-based means that each RDF triple – a subject, predicate and object – is written on a separate line, with no additional structure or nesting. This design allows machines to parse the data efficiently, making the format ideal for large-scale data processing and data dumps where efficiency matters more than human readability. NTriples is a preferred serialization for RDF data." assertion.
- CLARIAH-Media-Suite comment "The CLARIAH Media Suite is the central digital access point for AV collections and search and analysis tools in the Netherlands for students and researchers. It is part of the Dutch Infrastructure for Digital Humanities and Social Science developed in the CLARIAH project. The full version of the Media Suite is accessible to researchers and students from Dutch universities, higher education institutions and a select number of Dutch research institutions." assertion.
- REFI-QDA comment "The purpose of the REFI-QDA (Qualitative Data Analysis) standard is to enable the exchange of qualitative data between QDA programs. The standard has been developed by QDA software companies ATLAS.TI, Dedoose, f4 analysis, MAXQDA, NVivo, QDAMiner, Quirkos and Transana. The standard consists of two parts: REFI-QDA Codebook, for exchanging codebooks. REFI-QDA Project for the exchange of information about projects. The specifications of the REFI-QDA standard are openly available and the format is based on XML. See: https://www.qdasoftware.org. The REFI-QDA (Qualitative Data Analysis) standard is a preferred format for file type Computer Assisted Qualitative Data Analysis (CAQDAS)." assertion.
- TriG comment "TriG is a plain text serialization format designed for storing RDF datasets containing multiple graphs within a single file. It’s a compact and readable alternative to other XML-based syntaxes. It’s an extension of the Turtle syntax, adding functionality to allow graph separation, which is useful for representing datasets with different contexts or versions. TriG has been standardized by the W3C. TriG is a preferred serialization for RDF data." assertion.
- IFC comment "Industry Foundation Classes (.ifc) is an open, XML-based data model for the exchange of building information model data: data about buildings, construction and maintenance. It can describe all kinds of details of building structures. It was also designed to support CAD data. Various software packages support IFC, and there are also several free viewers available. IFC is a preferred format for file types 3D and CAD." assertion.
- COLLADA comment "COLLADA is an open-source XML format for storing interactive 3D applications. It was designed by Sony in collaboration with various large graphic design organizations. Today, COLLADA is managed by the non-profit organization Khronos Group. COLLADA .dae has been adopted by ISO as a publicly available specification. COLLADA is supported by popular software packages such as (GIS application) ArcGIS, (CAD application) Autodesk Infrastructure Modeller, Google Earth v1.4, Mac OS X 10.6 Preview. COLLADA supports various graphic 3D elements. It is intended as an intermediate format: to redirect 3D information in parts to other software. However, the file specification of COLLADA is not very strict, which means that interoperability between COLLADA exports from different software can cause problems. Although DANS marked COLLADA as a preferred format, X3D is preferred if 3D data can be supplied in X3D. COLLADA is a preferred format for file type 3D." assertion.
- glTF comment "The Graphics Language Transmission Format was designed as an interoperable format for loading scenes and models created with 3D tools, in modern applications. The glTF format is a JSON-based text format and is accompanied by binary data (.bin) for geometry, animations, and other associated data; and raster images (.jpg; .png) for textures. There are two versions of glTF: glTF 1.0, first released in October 2015; and glTF 2.0, released in June 2017. Version 2.0 is a significant upgrade from version 1.0 and is aimed at better support from applications. With the increasing adoption of version 2.0, version 1.0 seems to have become obsolete. glb is the extension for binary glTF: a container format that collects the glTF and its associated data into a single file. Binary glTF was introduced as an extension for glTF 1.0 and was incorporated in the glTF 2.0 specification. glTF is being developed in an open project by the non-profit organization Khronos Group. glTF 2.0 is a preferred format for file type 3D. glb is a preferred format for file type 3D on the condition that it is based on glTF 2.0. glTF 1.0 and glb based on glTF 1.0 are non-preferred formats for file type 3D." assertion.
- X3D comment "The X3D format is managed by the non-profit organization Web3D Consortium. It is an XML-based ISO standard set up with the aim of becoming the 3D standard for the web. In addition to a few 3D models, the format is also suitable for storing complex 3D data such as virtual reality. X3D contains fairly extensive support for the storage of graphic elements. X3D is a preferred format for file type 3D." assertion.
- Polygon-file-format comment "PLY is also known as the Stanford Triangle Format. It is a WaveFront Object-inspired format with the option of extensions describing additional features of a 3D model, including color and transparency. The .ply file can be supplied as an ASCII text file or as a binary format. ASCII PLY is preferred for accessibility and archiving in the long term. Not every software supports all extensions of the Polyon file format: it is possible that the data is not completely read, depending on the software used. That is why it is especially important with PLY that it is well documented which information the data contains. Polygon file format is a preferred format for file type 3D." assertion.
- WaveFront comment "WaveFront Object is a very widely supported open file format for the representation of 3D geometry. The file contains a clear, simple structure in which the spatial positions of each point of the object and texture coordinates are written. The .obj file can be supplied as an ASCII text file or as a binary format. ASCII OBJ is preferred for accessibility and archiving in the long term. In addition to the .obj file, material / texture information can be stored in an .mtl file. The texture itself is saved separately as an image. Read more about raster images: https://dans.knaw.nl/en/file-formats/images-raster/. OBJ does not store information about animations. WaveFront Object is a preferred format for file type 3D." assertion.
- WaveFront comment "WaveFront Object is a very widely supported open file format for the representation of 3D geometry. The file contains a clear, simple structure in which the spatial positions of each point of the object and texture coordinates are written. The .obj file can be supplied as an ASCII text file or as a binary format. ASCII OBJ is preferred for accessibility and archiving in the long term. In addition to the .obj file, material / texture information can be stored in an .mtl file. The texture itself is saved separately as an image. Read more about raster images: https://dans.knaw.nl/en/file-formats/images-raster/. OBJ does not store information about animations. WaveFront Object is a preferred format for file type 3D." assertion.
- ASCII-GRID comment "It is advisable to convert Grid files to ASCII text as much as possible. It can be expected from GIS applications that they can correctly import ASCII grid files. ASCII GRID is a preferred format for file type raster grid. " assertion.
- GeoTIFF comment "GeoTIFF is a metadata standard for adding georeference to a TIFF image. This metadata is included in the TIFF file itself. GeoTIFF is an extension of TIFF, an open format in the public domain. The format is supported by various commonly used GIS applications. GeoTIFF is a preferred format for file type Images (georeference). " assertion.
- MIF-MID comment "The “MapInfo Interchange Format” .mif, usually associated with the .mid file, is the export format of the MapInfo software, designed for GIS interoperability. It is a clear, well documented, well supported and stable ASCII text file. It can be expected from GIS applications that they can correctly import MIF/MID data and that map layers in the application’s own format can be exported to MIF/MID. MIF/MID is a preferred format for file type GIS. " assertion.
- GML comment "GML is the XML standard for geographic data, designed by the Open Geospatial Consortium. Since August 2007, GML has been adopted as the ISO standard ISO 19136. Support for GML was initially limited but has since increased significantly, now GIS applications can be expected to be able to correctly import GML data. GML is a preferred format for file type GIS. Read more about GIS. " assertion.
- AutoCAD-DXF comment "The developer Autodesk, with the main software AutoCAD, is a market leader in the field of CAD. The AutoCAD formats DXF and DWG are used extensively. No open formats have been developed for the exchange of CAD formats. DXF is specifically designed to facilitate data interoperability between AutoCAD and other programs and is therefore well supported by other CAD applications. DXF version R12 (ASCII) appears to be best supported for successful and correct import into other applications. A major problem with the use of DXF is the development of the DWG format. DWG now offers possibilities for which not all properties can be stored in DXF. If export to DXF R12 (ASCII) does not lead to information loss, this version of the DXF format is considered the best option for preserving AutoCAD formats in a relatively open, widely supported format. A conversion from DWG to DXF must be carefully checked for completeness. If elements of the DWG cannot be stored in the DXF, an export to a more recent version of DXF is acceptable. If conversion is not possible without loss of information, the DWG must be retained. DXF version R12 (ASCII) is a preferred format for file type Computer Aided Design (CAD). " assertion.
- OPUS comment "OPUS is a codec for Audio formats." assertion.
- FLAC comment "Free Lossless Audio Codec (FLAC) is an open format highly suitable for transcoding without quality loss (lossless compression). The FLAC specification includes a wrapper for FLAC bitstreams plus the lossless compression codec itself. FLAC can be used as a dissemination/delivery format, but is also a suitable format for the archival storage of audio streams. Adoption is moderate. FLAC is a DANS preferred format for audio, because of its lossless compression of digital audio, but adoption is moderate. " assertion.
- Matroska comment "The Matroska Multimedia Container is an open source, open standard multimedia container, that can wrap an unlimited number of audio, video, metadata and/or subtitle-files. Matroska Audio supports a large number of codecs. It has a very large userbase and is the basis for WebM media in HTML5/browsers. Most platforms offer native support for this format. Adoption of the file format is expected to further increase thanks to several outcomes of the PREFORMA project, including the Media Conch toolset, an implementation checker for Matroska. Matroska file extensions are .mkv for video (with subtitles and audio), .mka for audio-only files, and .mks for subtitles only. Matroska is a DANS preferred container format for audio and video." assertion.
- MXF comment "Material Exchange Format (MXF) is an open wrapper format developed by the Society of Motion Picture and Television Engineers. It supports a number of different bitstreams encoded with any of a variety of codecs, together with a metadata wrapper that describes the material contained within the MXF file. The wrapper is relatively transparent although the structure of an MXF file can be complex. Since invention of this format, there is an ever-increasing interest and adoption of MXF. It is emerging as a standard for professional digital video and audio media. For example, the Library of Congress National Audio-Visual Conservation Center produces a form of MXF as archival masters as they reformat their older videotapes. MXF is a DANS preferred container format for audio and video." assertion.
- BWF comment "The Broadcast Wave Format (BWF) is an open standard that is widely recommended as the archival master for audio preservation. It consists of uncompressed audio. It is an extension of the WAV format, since it may contain metadata which describes the format of the audio data and the encoding method, and carries a sample-accurate time stamp that can be used to place related files in the proper sequence. BWF can wrap either MPEG or LPCM encoding, although for long-term preservation the use of LPCM is preferred. BWF is a DANS preferred container format for Audio. " assertion.
- SVG comment "SVG stands for “Scalable Vector Graphics”. It is a robust, XML-based format for statistical and dynamic vector images. SVG is an open standard and is well supported by various applications. SVG vector images can be opened in web browsers such as Firefox, Safari, Google Chrome, Microsoft Edge and Explorer. For further processing, vector image applications such as Adobe Illustrator or Inkscape can be used. Inkscape is free to download from the website and works on Windows, Mac OS X and Linux. All common Vector Image formats (EPS, AI, WMF, CDR) can be opened in Inkscape and Adobe Illustrator and converted to SVG. SVG is a preferred format for file type Images (vector) and for file type Computer Aided Design (CAD)." assertion.
- JASP comment "JASP is a free platform-independent statistical software package with a graphical user interface. It is an open source alternative for proprietary software like SPSS, and is being developed by the University of Amsterdam. In JASP, you have the option to export data in formats which can be easily read into other applications such as .csv. Results of analyses are being exported in .html format. These formats have been added to our preferred formats within the file type of Statistical data. " assertion.
- DICOM comment "DICOM stands for Digital Imaging and Communications in Medicine. It is a communication standard for dealing with medical information generated by medical equipment, such as scanners. DICOM has been developed by the Medical Imaging and Technology Alliance (MITA). Before the communication standard existed, information was stored in various file formats associated with the software of a specific medical device. DICOM has put an end to this. DICOM is widely used and is the standard within medical digital image processing. Both doctors and medical imaging research groups use the DICOM file format for their research. In the standard, various types of images are supported for different medical applications, both still images and moving images. DICOM supports commonly used compression standards, such as JPEG and JPEG2000, or MPEG-2 for video images. The copyright on the standard is held by the American National Electrical Manufacturers Association (NEMA). DICOM viewing software can be divided into two groups: (1) “proprietary” viewers that are part of the (medical) recording systems and (2) DICOM viewing software for PCs. Non-proprietary viewers that are available for free are DicomWorks, Osiris and IrfanView (a widely used all-format viewer). Adobe has developed a plug-in for Photoshop that makes it possible to view or export DICOM images and “header” (= metadata) information to other formats. The IrfanView program is also capable of extracting images and / or animations (sequence of images) from DICOM files. DICOM files are a preferred format, due to their open specification and adoption as standard within the medical world. In certain circumstances it may be useful to save a single DICOM file from the entire dataset as .jpeg / tiff (the image) and .txt file (the header). This is due to the fact that few people outside the medical world have experience with DICOM. This way you can easily view one file from the dataset, without having to use an open DICOM viewer. As datasets containing DICOM files tend to be relatively large, we advise you to contact a DANS data manager before depositing these files. We are happy to advise you on the best way to deliver data and store it in the dataset for user-friendly purposes DICOM is a preferred format for file type Images (raster)." assertion.
- SIARD comment "For relational databases, SIARD is seen as a suitable and sustainable format. SIARD (Software Independent Archiving of Relational Databases) is intended for archiving relational databases in a way that is as independent of the original DBMS as possible. This format takes into account all the significant characteristics of databases. SIARD is an open, freely available format, based on clear text formats: Unicode, XML, SQL (1999). This makes it accessible for various tools. SIARD is a relatively young format, see SIARD_Suite. SIARD is a preferred format for file type Databases." assertion.
- SQL comment "Many DBMSs support the ISO-standardized version of Structured Query Language (SQL): a language for querying and updating relational databases. Together with the Data Definition Language, used to define and modify schemas, the contents of a database can be stored as a collection of schema and data statements. The language rarely changes, but the various modifications and additions may change along with software updates. If extensions are used, the documentation must show which SQL version is used. It is possible to refer to non-existent and / or external data in SQL, without invalidating the file. If SQL is used for exchanging data, any external references must therefore be supplied, or each reference must be replaced by the data referred to. SQL is a preferred format for file type Databases. " assertion.
- ODS comment "ODF is an ISO standardized format for office documents. For spreadsheets, it uses the .ods extension; the other two extensions are .odt for textual files and .odp for presentations. ODF files are essentially XML files that comply with the XML schema defined in ISO / IEC 26300: 2006. The last revision dates from 2015. The advantage of ODS is that it is an open standard that can be processed by multiple applications. Metadata can be added in the form of RDF to later versions of ODF. You can add metedata up to the level of cells, in the case of spreadsheets. Examples of office suites that use ODF include OpenOffice and LibreOffice. MSExcel files can also be saved as ODS. There are various software libraries that can be used to program ODF programmatically. Media type The media type of ODS files is application/vnd.oasis.openocument.spreadsheet and application/vnd.oasis.openocument.spreadsheet-template for templates. Also see Planning for Library of Congress Collections: https://www.loc.gov/preservation/digital/formats/fdd/fdd000439.shtml ODS is a preferred format for the Spreadsheets file type. " assertion.
- Text-Fabric comment "Files in the text-fabric file format (.tf) store a column of feature values that correspond to nodes and edges in a graph, which together represent annotated text. So, one could say that .tf is a Markup format. Annotated Text In the humanities, primary research data often takes the form of texts. Many of these texts are historical artefacts and a lot of knowledge is needed to interpret them. Annotations are a preferred way to represent this knowledge. They may convey detailed linguistic information at the word level, but they can also link persons, places, materials, and concepts found in the text to external descriptions. Texts are always structured, and annotations need an addressing mechanism to target the specific portions in the text that they are about. The annotations tend to form bodies of knowledge in themselves, and need to be shared and distributed as separate entities. Data model Text-Fabric is a tool that facilitates this exchange of data. In order to do so, it defines a model [TF model] for annotated text. In this model, text is an annotated graph: a system of nodes and edges between nodes, where nodes and edges are linked to other information by means of features. The nodes stand for textual concepts, such as words, sentences, chapters, and the edges for relationships between these portions of text. Features are mappings from nodes or edges to values. Nodes themselves are just integer numbers, and edges are just pairs of numbers. This model is very close to Linguistic Annotation Framework [LAF ISO standard]. The main differences are that LAF prefers to be represented in XML and Text-Fabric is XML-free, and that a LAF dataset may reside in a single or in separate files at the choice of the corpus designer, while a Text-Fabric dataset always stores a single feature in a single file. Node features A node feature is a mapping from numbers to values: a column of values, where the position in the column corresponds to the number of the node. Edge features An edge feature can be seen as a mapping from nodes to other nodes, where a value may be supplied for each connection. Edge features are also columns of values, where the postion in the column corresponds to the number of the node where the edges start. File format Text-Fabric defines an efficient way to store features in files [TF file format]. Each feature occupies a single file. A Text-Fabric dataset is just a flat collection of feature files. Extension Feature files typically have extension .tf . Tools Text-Fabric is also a library [TF API] by which you can process text and annotations. It understands the .tf file format and offers an API to load and save feature files and to compute with the data contained in them. Text-Fabric compiles .tf files into binary .tfx files which are optimised to load very fast. These .tfx files are just a convenience but are not suitable for archiving and should not be considered a preferred or even acceptable format. They are dependent on the computer where they have been generated. Text-Fabric is by no means required to make sense of .tf files. The format is so transparent that several users bypass the tool Text-Fabric and have written their own programs (in languages other than Python) to ingest .tf files. Corpora A number of corpora [TF Corpora] have already been converted to Text-Fabric, such as the Hebrew Bible, various Cuneiform tablet collections, the Quran, and more. For all these corpora there are dedicated tutorials [TF tutorials] that show the practice that Text-Fabric supports. References TF model: Model – Text-Fabric (archived version) TF file format: Format – Text-Fabric (archived version) TF optimizations: Optimizations – Text-Fabric (archived version) TF example: Banks: convert.ipynb (archived version) TF API: TF – Text-Fabric (archived version) TF Corpora: Corpora – Text-Fabric (archived version) TF Tutorials: tutorials (archived version) LAF ISO Standard: ISO 24612:2012 – Linguistic annotation framework (LAF) Text-Fabric is a preferred format for file type Programming languages. " assertion.
- Matlab comment "MATLAB is a programming platform designed specifically for engineers and scientists. The heart of MATLAB is the MATLAB language, a matrix-based language allowing the most natural expression of computational mathematics.The name MATLAB stands for matrix laboratory. MATLAB was originally written to provide easy access to matrix software developed by the LINPACK and EISPACK projects, which together represent the state-of-the-art in software for matrix computation. MATLAB has evolved over a period of years with input from many users. In university environments, it is the standard instructional tool for introductory and advanced courses in mathematics, engineering, and science. In industry, MATLAB is the tool of choice for high-productivity research, development, and analysis. This format is accepted in DANS with no further conversions required in its executable code format (.m) and its workspace files (.mat). We strongly suggest to build up your code as neatly as possible and clearly state which version of MATLAB was used to develop the code, include comments where necessary, and make use of good programming practices. The code should be clear enough for users to understand the results and for possible reproduction of the study and methods. MATLAB is a preferred format for file type Programming languages. " assertion.
- Markdown comment "Markdown is a lightweight markup language that is known for its simplicity and ease of use for writing structured documents. It was created in 2004 by John Gruber, with significant contributions from Aaron Swartz. Gruber wanted to create a plain-text format that could be easily converted to structurally valid HTML. His goal was to develop a syntax that was both human-readable and easy to write, as opposed to the verbose nature of HTML. The resulting Markdown consists of two components: a plain text formatting syntax, and a software tool that converts the plain text to HTML. The first Markdown version was released as an open-source project and was quickly adopted by bloggers and people working with wikis. Today it is widely used by platforms like GitHub (as README files), Reddit, Stack Overflow, and Discord. Markdown’s readability and ease of use make it popular in a variety of domains, including web content creation, software documentation, note-taking and manuscript writing. It allows bloggers, journalists, developers and authors to create documents for the web without needing a deep understanding of HTML or CSS, as Markdown files can easily be converted into fully formatted documents. While Markdown is widely used, there isn’t a universally accepted standard specification for its syntax. The exact implementation can differ across platforms and versions. Markdown parsers and implementations may interpret the same input differently, leading to potential difficulties when moving content across platforms or converting them into different formats. For example, certain Markdown features, such as code blocks, images and footnotes, may be rendered differently on Github versus a personal website using a different Markdown processor. Note that these rendering differences don’t affect the content of the Markdown file, just the layout. Interpretations can also differ across different Markdown parsers. This has led to the development of slightly different Markdown syntax variations that you can refer to as flavors or dialects. The vast majority of the syntax will be the same across dialects, so don’t worry about it too much. The dialects may differ in which features are supported (such as tables or math blocks), and may handle ambiguities differently. DANS accepts any markdown dialect. A Markdown document can have a .md or .markdown extension. Markdown is a preferred file format for file type Markup Language." assertion.
- XML comment "XML (eXtensible Markup Language) is a generic markup language designed by the World Wide Web Consortium (W3C) and is used in XML documents, a class of data objects that describe the behavior of algorithms through which they are processed. XML is an application profile and a limited form of SGML (Standard Generalized Markup Language). The format of XML is in its most basic aspects the same as plain text, which makes XML readable for people. As the complexity increases, readability decreases and we depend on specialized applications for interpretation. This means that the possibilities for preserving XML over a longer period of time depend on the degree of complexity on the one hand and the availability of those specialized applications on the other. XML document An XML document contains one or more elements within one root element. Each element consists of a start tag and an end tag or is an empty element. Between the start and end tag is the content of the element, which can consist of text and / or child elements. An element can contain a set of attributes that consist of a name and (text) value. The structure of a valid XML document can be validated on syntax. An example of an online validator is the W3C Markup Validation Service: https://validator.w3.org/ Encoding of XML documents The encoding of XML documents is indicated in the XML prologue, a declaration before the root element. If no prologue is present, then the encoding is assumed to be the default for XML documents: UTF-8. Media type The general media type, or mime type, for XML documents is application/xml, also known as text/xml. The usual extension is .xml. XML subtypes XML has a wide variety of subtypes and specialized applications. Some of the most used: – XHTML – Extensible Hyper Text Markup Language, a stricter version of html – XSD – XML Schema Definition, a formal description of the elements in an XML document – XSLT – XSL (eXtensible Stylesheet Language) Transformations, for transforming XML documents into other formats (in particular XML and HTML). Specialized applications of XML include: – RDF XML – Resource Description Framework (RDF) described in XML, according to RFC3870 – RSS XML – RSS (Realy Simple Syndication) a type of web feed – WSDL XML – (Web Services Description Language) an XML-based language for describing web services – GML – Geography Markup Language, a markup language for describing geographical features – SVG – Scalable Vector Graphics, a markup language for images See also Planning for Library of Congress Collections: https://www.loc.gov/preservation/digital/formats/fdd/fdd000075.shtml XML is a preferred format for file type Markup Language. " assertion.
- ODT comment "The OpenDocument format for file formats has been developed by the Organization for the Advancement of Structured Information Standards (OASIS) as XML-based open standards for storing and exchanging various types of data. OpenDocument was incorporated as an official ISO standard on 11 November 2006. This standard is further elaborated in OpenDocument 1.1 (1 February 2007) and OpenDocument 1.2 (29 September 2011). The file formats are in fact ZIP compression files within which XML files and any additions such as images have been collected. The free Apache OpenOffice software uses OpenDocument formats as standard formats. Outside of this package, OpenDocuments enjoy good support. Microsoft Office has supported OpenOffice formats since version 2007, although problems may occur with specific properties of the OpenDocuments. In many countries, including the Netherlands, the use of open source software and OpenDocuments has been set as standard for making government documentation accessible. OpenDocument formats are suitable for archiving, accessibility and reusability in the long term. The OpenDocument format for text documents is OpenDocument Text (.odt). ODT is a preferred format under the Text documents file type." assertion.
- CROISSANT comment "" assertion.
- DANS-Datastation-SSH-MD-Template comment "" assertion.
- DANS-Datastation-SSH-MD-Template comment "" assertion.
- Geonames-ontology comment "The GeoNames geographical database contains over 25 million geographical names and consists of over 12 million unique features whereof 4.8 million populated places and 16 million alternate names." assertion.
- Dataverse-Subject-List comment "List of subject headings supported by Harvard Dataverse. " assertion.
- NARCIS-Classification comment "" assertion.
- DataverseAccessRequestService comment "A Dataverse installation account can be created with the Username/Email log in option, using the “Sign Up” page." assertion.
- GithubLogin comment "GitHub login is the process of authenticating using credentials used to access Github. " assertion.
- Getuigenverhalen comment "Collectie Interviews met ooggetuigen van gebeurtenissen in de Tweede Wereldoorlog (onderdeel van Oorlogsbronnen.nl,het startpunt voor iedereen die meer wil weten over de Tweede Wereldoorlog in Nederland)" assertion.
- CLARIAH-Media-Suite comment "The CLARIAH Media Suite is the central digital access point for AV collections and search and analysis tools in the Netherlands for students and researchers. It is part of the Dutch Infrastructure for Digital Humanities and Social Science developed in the CLARIAH project. The full version of the Media Suite is accessible to researchers and students from Dutch universities, higher education institutions and a select number of Dutch research institutions." assertion.
- CLARIAH-Media-Suite comment "The CLARIAH Media Suite is the central digital access point for AV collections and search and analysis tools in the Netherlands for students and researchers. It is part of the Dutch Infrastructure for Digital Humanities and Social Science developed in the CLARIAH project. The full version of the Media Suite is accessible to researchers and students from Dutch universities, higher education institutions and a select number of Dutch research institutions." assertion.
- CLARIAH-Media-Suite comment "The CLARIAH Media Suite is the central digital access point for AV collections and search and analysis tools in the Netherlands for students and researchers. It is part of the Dutch Infrastructure for Digital Humanities and Social Science developed in the CLARIAH project. The full version of the Media Suite is accessible to researchers and students from Dutch universities, higher education institutions and a select number of Dutch research institutions." assertion.
- assertion comment "Replace this with your assertion content." assertion.
- Matroska comment "The Matroska Multimedia Container is an open source, open standard multimedia container, that can wrap an unlimited number of audio, video, metadata and/or subtitle-files. Matroska Audio supports a large number of codecs. It has a very large userbase and is the basis for WebM media in HTML5/browsers. Most platforms offer native support for this format. Adoption of the file format is expected to further increase thanks to several outcomes of the PREFORMA project, including the Media Conch toolset, an implementation checker for Matroska. Matroska file extensions are .mkv for video (with subtitles and audio), .mka for audio-only files, and .mks for subtitles only. Matroska is a DANS preferred container format for audio and video." assertion.
- Hypermarker_proposal comment "Final proposal submitted for Hypermarker" assertion.
- DANSDataStationPolicy comment "The DANS Data Station Policy defines (among other things) the persistency policy applied by DANS." assertion.
- DANS_DataStationPolicy comment "The scope of the DANS Data Stations Policy (“the Policy”) is limited to datasets for which DANS performs ingestion, curation, archiving, long-term preservation, and dissemination. Such datasets are stored and managed in the repository services referred to as a ‘DANS Data Station’, which combines a public facing, domain specific repository based on Dataverse software where the datasets are managed and made accessible, and a service referred to as ‘the Vault’ where datasets are archived for long-term preservation. The Policy does not consider archiving and preservation of other materials, such as DANS’s web pages, internal and external documents, and digital objects in any other services that DANS provides. The Policy governs the obligations, responsibilities, and expectations of the role players." assertion.
- assertion comment "This Nanopublication is the duplicate of an existing Nanpublication." assertion.
- thanks comment "Thank you for this fantastic nanopub :)" assertion.
- thanks comment "I find this useful." assertion.
- thanks comment "This is a very nice project and well documented." assertion.
- assertion comment "Hello world!" assertion.
- 20964471.2024.2429847 comment "This is the core principle explaining why DGGS improves computational performance. By representing space as discrete zones rather than continuous coordinates, spatial operations become simple ID comparisons rather than complex geometric intersections. This fundamental insight underpins all the performance benefits demonstrated in the paper's benchmarks." assertion.
- 20964471.2024.2429847 comment "Technical detail for implementing horizontal scaling with H3. This partitioning strategy enables predictable memory allocation and balanced workload distribution across compute nodes. The fixed partition size is key to achieving efficient parallel processing for large-scale geospatial classification tasks." assertion.
- 20964471.2024.2429847 comment "Key finding from the raster benchmarking experiment. While conceptually similar, the DGGS approach combined with modern data engineering tools (Parquet, Polars) provides measurable performance improvements even compared to optimized raster workflows. This suggests DGGS adoption can be justified on efficiency grounds alone." assertion.
- 20964471.2024.2429847 comment "Critical finding demonstrating the scalability limitations of traditional vector workflows. As land-use mapping tasks require more input layers, vector-based approaches become increasingly impractical. DGGS offers a solution by replacing expensive geometric unions with simple attribute joins on zone IDs." assertion.