Matminer: An open source toolkit for materials data mining

Python(编程语言) 开源 实施 数据挖掘 可视化 范围(计算机科学) 计算机科学 数据可视化 数据结构 数据科学 软件工程 软件 程序设计语言
作者
Logan Ward,Alexander Dunn,Alireza Faghaninia,Nils Zimmermann,Saurabh Bajaj,Qi Wang,Joseph H. Montoya,Jiming Chen,Kyle Bystrom,Maxwell Dylla,Kyle Chard,Mark Asta,Kristin A. Persson,G. Jeffrey Snyder,Ian Foster,Anubhav Jain
出处
期刊:Computational Materials Science [Elsevier]
卷期号:152: 60-69 被引量:671
标识
DOI:10.1016/j.commatsci.2018.05.018
摘要

Abstract As materials data sets grow in size and scope, the role of data mining and statistical learning methods to analyze these materials data sets and build predictive models is becoming more important. This manuscript introduces matminer, an open-source, Python-based software platform to facilitate data-driven methods of analyzing and predicting materials properties. Matminer provides modules for retrieving large data sets from external databases such as the Materials Project, Citrination, Materials Data Facility, and Materials Platform for Data Science. It also provides implementations for an extensive library of feature extraction routines developed by the materials community, with 47 featurization classes that can generate thousands of individual descriptors and combine them into mathematical functions. Finally, matminer provides a visualization module for producing interactive, shareable plots. These functions are designed in a way that integrates closely with machine learning and data analysis packages already developed and in use by the Python data science community. We explain the structure and logic of matminer, provide a description of its various modules, and showcase several examples of how matminer can be used to collect data, reproduce data mining studies reported in the literature, and test new methodologies.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
3秒前
大炮运输发布了新的文献求助100
4秒前
tjolinchen发布了新的文献求助10
5秒前
6秒前
HY发布了新的文献求助10
7秒前
科研通AI2S应助科研通管家采纳,获得10
7秒前
所所应助科研通管家采纳,获得10
8秒前
cocolu应助科研通管家采纳,获得10
8秒前
小马甲应助科研通管家采纳,获得10
8秒前
Hello应助科研通管家采纳,获得10
8秒前
科研通AI2S应助科研通管家采纳,获得10
8秒前
慕青应助科研通管家采纳,获得10
8秒前
cocolu应助科研通管家采纳,获得10
8秒前
小蘑菇应助科研通管家采纳,获得10
8秒前
8秒前
fuu发布了新的文献求助10
8秒前
听风发布了新的文献求助10
10秒前
早睡早起健康长寿完成签到,获得积分10
10秒前
11秒前
12秒前
Lucas应助不将就采纳,获得10
13秒前
13秒前
大炮运输完成签到,获得积分10
14秒前
14秒前
试验顺利应助KhanhTran采纳,获得10
15秒前
JamesPei应助听风采纳,获得10
16秒前
16秒前
硅负极发布了新的文献求助10
16秒前
17秒前
实验菜菜君完成签到 ,获得积分10
17秒前
学术渣渣发布了新的文献求助10
21秒前
NexusExplorer应助壮观的晓瑶采纳,获得10
21秒前
22秒前
23秒前
tjolinchen完成签到,获得积分20
26秒前
DAWN关注了科研通微信公众号
26秒前
SN完成签到,获得积分0
26秒前
28秒前
一一应助硅负极采纳,获得10
28秒前
高分求助中
Solution Manual for Strategic Compensation A Human Resource Management Approach 1200
Natural History of Mantodea 螳螂的自然史 1000
Glucuronolactone Market Outlook Report: Industry Size, Competition, Trends and Growth Opportunities by Region, YoY Forecasts from 2024 to 2031 800
A Photographic Guide to Mantis of China 常见螳螂野外识别手册 800
Autoregulatory progressive resistance exercise: linear versus a velocity-based flexible model 500
The analysis and solution of partial differential equations 400
Spatial Political Economy: Uneven Development and the Production of Nature in Chile 400
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 细胞生物学 免疫学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3334447
求助须知:如何正确求助?哪些是违规求助? 2963653
关于积分的说明 8610845
捐赠科研通 2642632
什么是DOI,文献DOI怎么找? 1446831
科研通“疑难数据库(出版商)”最低求助积分说明 670421
邀请新用户注册赠送积分活动 658611