Data intensive study of accessibility of edible species across the globe


Variety of accessibility to edible species in different regions has climatic and historical roots. In the present study, we try to systematically analyze 28,064 records of relationships between 11,752 edible species and 228 geographic zones by hierarchical clustering. 228 geographic regions were classified into 10 super groups named as A to K, which were divided into 39 clusters (CLs). Of them, at least one member of each of 28 CLs is associated to 20 or more edible species according to present information of KNApSAcK DB (http://kanaya.naist.jp/KNApSAcK_World/top.jsp). We examined those 28 CLs and found that majority of the members of each of the 27 CLs (96%) have specific type of climate. Diversity of accessibility to edible species makes it possible to separate 8 geographic regions on continental landmasses namely Mediterraneum, Baltic Sea, Western Europe, Yucatan Peninsula, South America, Africa and Arabian Peninsula, Southeast Asia, Arctic Ocean; and three archipelagos namely, Caribbean Islands, Southeast Asian Islands, and Pacific Islands. In addition, we also examined clusters based on cultural exchanges by colonization and migration and mass movement of people and material by modern transportation and trades as well as biogeographic factors. The era of big data science or data intensive science make it possible to systematically understand the content in huge data and how to acquire suitable data for specific purposes. Human healthcare should be considered on the basis of culture, climate, accessibility of edible foods and preferences, and based on molecular level information of genome and digestive systems.