DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (OKG) which is available for everyone on the Web. A knowledge graph is a special kind of database which stores knowledge in a machine-readable form and provides a means for information to be collected, organized, shared, searched and utilized. Google uses a similar approach to create those knowledge cards during search. We hope that this work will make it easier for the huge amount of information in Wikimedia projects to be used in some new interesting ways.
In this datastory, we are conducting an exploratory analysis on Iris dataset. Iris flower dataset has information on the three related species of Iris flowers in order for quantification of their morphologic variation. Below, there are several tables using SPARQL queries that show variations between and within the Iris flower species.
GoodRelations: The Web Vocabulary for E-Commerce
GoodRelations is a standardized vocabulary (also known as “schema”, “data dictionary”, or “ontology”) for product, price, store, and company data that can (1) be embedded into existing static and dynamic Web pages and that (2) can be processed by other computers. This increases the visibility of your products and services in the latest generation of search engines, recommender systems, and other novel applications.
Gene Ontology (GO)
The Gene Ontology resource provides a computational representation of our current scientific knowledge about the functions of genes (or, more properly, the protein and non-coding RNA molecules produced by genes) from many different organisms, from humans to bacteria. It is widely used to support scientific research, and has been cited in tens of thousands of publications.
Understanding gene function—how individual genes contribute to the biology of an organism at the molecular, cellular and organism levels—is one of the primary aims of biomedical research. Moreover, experimental knowledge obtained in one organism is often applicable to other organisms, particularly if the organisms share the relevant genes because they inherited them from their common ancestor. The Gene Ontology (GO), as a consortium, began in 1998 when researchers studying the genome of three model organisms—Drosophila melanogaster (fruit fly), Mus musculus (mouse), and Saccharomyces cerevisiae (brewer’s or baker’s yeast)—agreed to work collaboratively on a common classification scheme for gene function, and today the number of different organisms represented in GO is in the thousands. GO makes it possible, in a flexible and dynamic way, to provide comparable descriptions of homologous gene and protein sequences across the phylogenetic spectrum.
GO is also at the hub of a major effort to represent the vast amount of biomedical knowledge in a computable form. GO is linked to many other biomedical ontologies, and is a foundation for research applying computer science in biology and medicine.
OWL-Time is an OWL-2 DL ontology of temporal concepts, for describing the temporal properties of resources in the world or described in Web pages. The ontology provides a vocabulary for expressing facts about topological (ordering) relations among instants and intervals, together with information about durations, and about temporal position including date-time information. Time positions and durations may be expressed using either the conventional (Gregorian) calendar and clock, or using another temporal reference system such as Unix-time, geologic time, or different calendars.
Smithsonian American Art Museum (SAAM)
The Smithsonian American Art Museum (SAAM), the United States' first collection of American art, is the home to one of the largest and most inclusive collections of American art in the world. Its artworks reveal America’s rich artistic and cultural history from the colonial period to today, with more than 7,000 artists are represented in the collection, The museum has been a leader in identifying and collecting significant aspects of American visual culture, including photography, modern folk and self-taught art, African American art, Latino art, and video games. The museum has the largest collection of New Deal art and exceptional collections of contemporary craft, American impressionist paintings, and masterpieces from the Gilded Age, and maintains six online research databases with more than a half million records, including the Inventories of American Painting and Sculpture that document more than 400,000 artworks in public and private collections worldwide.
COVID-19 statistieken voor Nederland
Pleiades is a community-built gazetteer and graph of ancient places. It publishes authoritative information about ancient places and spaces, providing unique services for finding, displaying, and reusing that information under open license. It publishes not just for individual human users, but also for search engines and for the widening array of computational research and visualization tools that support humanities teaching and research.
Wikidata is a free and open knowledge base that can be read and edited by both humans and machines.
Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.
Wikidata also provides support to many other sites and services beyond just Wikimedia projects! The content of Wikidata is available under a free license, exported using standard formats, and can be interlinked to other open data sets on the linked data web.
The World Factbook
The World Factbook, also known as the CIA World Factbook, is a reference resource produced by the Central Intelligence Agency with almanac-style information about the countries of the world.
Linked Internet Movie Database (IMDb)
IMDb is an online database of information related to world films, television programs, home videos and video games, and internet streams, including cast, production crew, personnel and fictional character biographies, plot summaries, trivia, and fan reviews and ratings.
Triply has converted the famous Iris flower dataset to linked data! It is a multivariate dataset that quantifies the morphologic variation of Iris flowers of three different species, measured in four different properties. In this data cube, each species of Iris occurs 50 times and this linked data version uses the RDF Data Cube Vocabulary.
How to start a SPARQL service
TriplyDB allows you to expose your dataset through SPARQL. Exposing your data via SPARQL gives you the opportunity to create SPARQL queries and datastories over your own dataset or datasets from others. On TriplyDB you can already find a several examples of SPARQL queries. But creating your own SPARQL queries requires you to first start a SPARQL service over your dataset. The following step by step guide helps you to start a SPARQL service.
- Go to the Services page and you'll see a form to create a SPARQL service.
- To Create a SPARQL service you fill in a name for your service and select SPARQL from the three options.
Create serviceto confirm your choices and a SPARQL service will be started.
- Wait until the status of the service changed to
- A new option called
SPARQLwill appear in the sidebar. Clicking the button opens the SPARQL editor where you can write queries over your dataset.
How to import DBpedia
The iris dataset reuses classes, properties and resources from DBpedia. This not only reduces the amount of maintenance, but by reusing objects from DBpedia we can make use of the links that DBpedia already created. But before you can use the objects from DBpedia you'll first need to import DBpedia into the Iris dataset. The following step by step guide helps you do exactly that.
- Go to the Graphs page and click on
import a new graph
- Click on
Add data from an existing dataset
- Type in
DBpedia-association / dbpediafrom the dropdown menu.
- The page should now change and there is now one graph selected. This graph consist out of
369.205.380statements and is the full DBpedia dataset.
- To import this into your dataset you can click
import 1 graphs. This will add the DBpedia graph into your dataset.
- You've now imported the DBpedia graph into your dataset. You can now use the browser and see more information about DBpedia resources.
- To remove the DBpedia graph from your dataset you can go the graphs page and remove the dataset by clicking on the
https://triplydb.com/wikimedia/dbpedia/graphs/defaultgraph. This will remove your local connection to DBpedia.
PS: It is not allowed to start or sync a service when DBpedia is added as a graph. To start a service you will first need to remove the DBpedia graph by following step 7.
WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. The resulting network of meaningfully related words and concepts can be navigated with the browser. WordNet's structure makes it a useful tool for computational linguistics and natural language processing.