Visualizing matrices

计算机科学 可视化 基质(化学分析) 表(数据库) 任务(项目管理) 数据挖掘 化学计量学 降维 数据科学 情报检索 灰度 人工智能 机器学习 图像(数学) 复合材料 经济 管理 材料科学
作者
Richard G. Brereton
出处
期刊:Journal of Chemometrics [Wiley]
卷期号:31 (3) 被引量:6
标识
DOI:10.1002/cem.2834
摘要

One of the most important jobs of the applied scientist is to visualize data. Instead of tables of numbers, graphs that are easily comprehensible to fellow scientists often help understand trends. Chemometrics has two main roles, a qualitative one that allows the data scientist to view the data and a quantitative role that involves modeling and predictions. Being able to turn matrices into graphs is an important task. Often tabular data are too complicated to readily understand; for example, a spectroscopist usually visualizes their spectrum rather than examines a table of numbers. As another example, a grayscale picture could be represented by matrix consisting of a table of pixilated intensities, but that is hard for the human eye to understand. This visualization can be conceptual, for example, in more than 3 dimensions, but the "picture" is held in the computer, and these simplifications are part of important mathematical approaches for determining the main trends in data. In this article, we will primarily illustrate using matrices of low dimensionality, but the principles can easily be extended to however many dimensions are required, and valuable information can be extracted from virtual multidimensional representations. An expert in multivariate analysis needs a cool mind to ensure he or she distinguishes carefully related but similar ideas. There are two different ways of representing this matrix. In practice we deal with much more complex datasets; for example, a 100 × 20 dataset may be the result of analyzing 100 objects using 20 variables, perhaps the result of analyzing 100 plant extracts and measuring the intensity of 20 GCMS peaks on each extract. In such circumstance the variable space is an imaginary 20-dimensional space and the object space 100-dimensional. In previous articles we have discussed the effect of column centering.1 In our case the matrix is transformed to , and the relationships between the new vectors change. In fact the distance between the objects in variable space is unchanged although they are now centered around the origin, but the distance between the variables in object space changes. Hence, according to whether we center a dataset or not we may come to different conclusions as to how different variables (for example, marker compounds) relate. It is important to remember that linear independence has a specific algebraic and geometric definition. For example, the matrix has rank 2, or a column/row space that is 2-dimensional, so is rank deficient, as the third column equals the first plus twice the second. However, the matrix is not, even though, the third column can be exactly obtained from the first 2 by subtracting 7 from the third column in the first matrix and hence requires 3 dimensions to represent it in either variable or row space: it is easy to show using standard approaches in Excel or matlab or any other favorite language or environment that the first matrix has zero determinant and is singular but the latter has a determinant of −42 and is invertible. In the last few articles we have discussed a variety of related concepts. These include linear independence, rank, column and row space, singular matrices, correlation, and orthogonality. Many of these concepts are viewed as similar to each other, but there are differences, often skated over in chemometrics or indeed many statistical textbooks and papers. The effect of centering also influences the relationship between columns of a matrix and so their overall properties. Most courses and general texts are not aimed at chemometrics experts; the majority aimed towards physicists and engineers who have quite different needs. Even the definition of a vector in chemometrics differs from that in physics. But an expert in multivariate analysis, which is an important basis of chemometrics, needs a cool mind to ensure he or she distinguishes carefully these related but similar ideas. There are many documents available that discuss these concepts in substantially more detail, a few of which are referenced.3-6 Chemometrics experts should be aware that the ideas in this article are part of quite sophisticated treatments of matrix algebra and some may wish to delve further.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
风笛发布了新的文献求助10
1秒前
1秒前
陈陈陈晨完成签到,获得积分10
1秒前
1秒前
1秒前
whx99222发布了新的文献求助10
3秒前
lpd完成签到,获得积分10
3秒前
常常完成签到,获得积分10
4秒前
轻舟完成签到 ,获得积分10
4秒前
在水一方应助linxi采纳,获得10
5秒前
5秒前
15566发布了新的文献求助10
7秒前
Dawn完成签到,获得积分10
7秒前
125ljw发布了新的文献求助10
7秒前
时倾完成签到 ,获得积分10
8秒前
华仔应助还单身的人英采纳,获得10
8秒前
8秒前
8秒前
西津渡完成签到,获得积分10
9秒前
Chillichee应助liuzengzhang666采纳,获得100
9秒前
与闲完成签到,获得积分10
9秒前
11秒前
linxi完成签到,获得积分10
11秒前
Jasper应助熹熹采纳,获得10
12秒前
12秒前
呐呐呐发布了新的文献求助10
12秒前
深情安青应助Zlla1024采纳,获得10
12秒前
Jasper应助125ljw采纳,获得10
13秒前
打打应助cui采纳,获得10
13秒前
夕诙应助小木木壮采纳,获得10
14秒前
whx99222发布了新的文献求助10
14秒前
67ai完成签到,获得积分10
16秒前
西津渡发布了新的文献求助10
16秒前
novia完成签到,获得积分10
16秒前
16秒前
毛豆应助liuzengzhang666采纳,获得10
16秒前
劲秉应助幸运星采纳,获得20
17秒前
慕青应助吴子秋采纳,获得10
17秒前
Gilana发布了新的文献求助10
18秒前
酷波er应助槿川采纳,获得10
18秒前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2500
Востребованный временем 2500
Agaricales of New Zealand 1: Pluteaceae - Entolomataceae 1040
지식생태학: 생태학, 죽은 지식을 깨우다 600
海南省蛇咬伤流行病学特征与预后影响因素分析 500
Neuromuscular and Electrodiagnostic Medicine Board Review 500
ランス多機能化技術による溶鋼脱ガス処理の高効率化の研究 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 纳米技术 内科学 物理 化学工程 计算机科学 复合材料 基因 遗传学 物理化学 催化作用 细胞生物学 免疫学 电极
热门帖子
关注 科研通微信公众号,转发送积分 3462155
求助须知:如何正确求助?哪些是违规求助? 3055764
关于积分的说明 9049223
捐赠科研通 2745354
什么是DOI,文献DOI怎么找? 1506226
科研通“疑难数据库(出版商)”最低求助积分说明 696019
邀请新用户注册赠送积分活动 695574