已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Visualizing matrices

计算机科学 可视化 基质(化学分析) 表(数据库) 任务(项目管理) 数据挖掘 化学计量学 降维 数据科学 情报检索 灰度 人工智能 机器学习 图像(数学) 材料科学 管理 经济 复合材料
作者
Richard G. Brereton
出处
期刊:Journal of Chemometrics [Wiley]
卷期号:31 (3) 被引量:6
标识
DOI:10.1002/cem.2834
摘要

One of the most important jobs of the applied scientist is to visualize data. Instead of tables of numbers, graphs that are easily comprehensible to fellow scientists often help understand trends. Chemometrics has two main roles, a qualitative one that allows the data scientist to view the data and a quantitative role that involves modeling and predictions. Being able to turn matrices into graphs is an important task. Often tabular data are too complicated to readily understand; for example, a spectroscopist usually visualizes their spectrum rather than examines a table of numbers. As another example, a grayscale picture could be represented by matrix consisting of a table of pixilated intensities, but that is hard for the human eye to understand. This visualization can be conceptual, for example, in more than 3 dimensions, but the "picture" is held in the computer, and these simplifications are part of important mathematical approaches for determining the main trends in data. In this article, we will primarily illustrate using matrices of low dimensionality, but the principles can easily be extended to however many dimensions are required, and valuable information can be extracted from virtual multidimensional representations. An expert in multivariate analysis needs a cool mind to ensure he or she distinguishes carefully related but similar ideas. There are two different ways of representing this matrix. In practice we deal with much more complex datasets; for example, a 100 × 20 dataset may be the result of analyzing 100 objects using 20 variables, perhaps the result of analyzing 100 plant extracts and measuring the intensity of 20 GCMS peaks on each extract. In such circumstance the variable space is an imaginary 20-dimensional space and the object space 100-dimensional. In previous articles we have discussed the effect of column centering.1 In our case the matrix is transformed to , and the relationships between the new vectors change. In fact the distance between the objects in variable space is unchanged although they are now centered around the origin, but the distance between the variables in object space changes. Hence, according to whether we center a dataset or not we may come to different conclusions as to how different variables (for example, marker compounds) relate. It is important to remember that linear independence has a specific algebraic and geometric definition. For example, the matrix has rank 2, or a column/row space that is 2-dimensional, so is rank deficient, as the third column equals the first plus twice the second. However, the matrix is not, even though, the third column can be exactly obtained from the first 2 by subtracting 7 from the third column in the first matrix and hence requires 3 dimensions to represent it in either variable or row space: it is easy to show using standard approaches in Excel or matlab or any other favorite language or environment that the first matrix has zero determinant and is singular but the latter has a determinant of −42 and is invertible. In the last few articles we have discussed a variety of related concepts. These include linear independence, rank, column and row space, singular matrices, correlation, and orthogonality. Many of these concepts are viewed as similar to each other, but there are differences, often skated over in chemometrics or indeed many statistical textbooks and papers. The effect of centering also influences the relationship between columns of a matrix and so their overall properties. Most courses and general texts are not aimed at chemometrics experts; the majority aimed towards physicists and engineers who have quite different needs. Even the definition of a vector in chemometrics differs from that in physics. But an expert in multivariate analysis, which is an important basis of chemometrics, needs a cool mind to ensure he or she distinguishes carefully these related but similar ideas. There are many documents available that discuss these concepts in substantially more detail, a few of which are referenced.3-6 Chemometrics experts should be aware that the ideas in this article are part of quite sophisticated treatments of matrix algebra and some may wish to delve further.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
研友_LBR9gL完成签到 ,获得积分10
刚刚
阿腾完成签到,获得积分10
刚刚
old幽露露完成签到 ,获得积分10
1秒前
WuFen完成签到 ,获得积分10
2秒前
ding应助zyf采纳,获得10
4秒前
8秒前
时代更迭完成签到,获得积分10
15秒前
周淼完成签到,获得积分10
17秒前
月冷完成签到 ,获得积分10
21秒前
22秒前
23秒前
23秒前
莫问题完成签到,获得积分10
25秒前
26秒前
英姑应助时代更迭采纳,获得30
26秒前
HJJHJH发布了新的文献求助50
26秒前
高山七石完成签到,获得积分10
27秒前
zyf发布了新的文献求助10
27秒前
小王子发布了新的文献求助10
32秒前
Lucas应助HJJHJH采纳,获得10
33秒前
zyf完成签到,获得积分10
34秒前
34秒前
万1发布了新的文献求助10
35秒前
35秒前
黄豆发布了新的文献求助10
40秒前
40秒前
dfwm完成签到,获得积分10
43秒前
43秒前
44秒前
上官若男应助畅快的涵蕾采纳,获得10
44秒前
49秒前
chenjingjing发布了新的文献求助10
49秒前
李健应助时代更迭采纳,获得30
51秒前
Pauline完成签到 ,获得积分10
57秒前
cling完成签到 ,获得积分10
57秒前
慎二完成签到 ,获得积分10
1分钟前
吴兰田完成签到,获得积分10
1分钟前
温婉的香水完成签到 ,获得积分10
1分钟前
1分钟前
sxmt123456789发布了新的文献求助10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Social Work Ethics Casebook: Cases and Commentary (revised 2nd ed.).. Frederic G. Reamer 1070
Introduction to Early Childhood Education 1000
2025-2031年中国兽用抗生素行业发展深度调研与未来趋势报告 1000
List of 1,091 Public Pension Profiles by Region 901
Item Response Theory 800
Identifying dimensions of interest to support learning in disengaged students: the MINE project 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5426242
求助须知:如何正确求助?哪些是违规求助? 4540046
关于积分的说明 14171474
捐赠科研通 4457840
什么是DOI,文献DOI怎么找? 2444698
邀请新用户注册赠送积分活动 1435630
关于科研通互助平台的介绍 1413164