• Data Science Radar测试结果


    众所周知,数据科学(Data Science)是一门交叉十分严重的的学科,混杂了数理统计、数据挖掘、模式识别、人工智能、编程等众多领域,各个领域都有非常多的概念与定义重合。

    这样复杂的学科背景让人无所适从,导致虽然很多人认为自己是做数据的,但不同的人具体的工作内容可能千差万别。

    国外一个做数据服务的网站mango-solutions将从事数据科学工作的人员分成了6个不同的方向,分别为Communicator,Data Wrangler, Modeller, Programmer, Technologist, Visualiser.并做了一个radar test,帮从业人员认识自己的角色。

    具体每个角色的定义与描述并没有直接给出。做完测试后会根据所属的角色给出相应的解释。

    记录下自己的测试结果: Communicator.

    是个偏传统的数据分析,主要给领导解读数据结果的角色。

    All great data scientists are master communicators.

    After all, data doesn't sell itself; it needs a communicator to guide the way.

    You are able to lead key business decision makers into an ongoing conversation with data, rather than carrying out ad hoc analyses.

    You have a natural ability to communicate complex technical details to non-technical audiences

    You also have an understanding of the wider implications of the project, and so convey the key analysis insights to influence new business directions.

    As a communicator you also understand that communication is a two-fold process: explain well, listen well.

    You listen for business challenges, define requirements and clarify how data analysis can help.

    Even on projects using highly technical software and mathematical methods, you are able to speak a language that each of your stakeholders understands!

    尽管我认为自己应该算个Modeller或者Programmer。或许我对Programmer的定义有什么误解?

    下面是关于Modeller的描述:

    By creating quantitative descriptions of your data, you create insight that is a key deliverable for your team.

    You interpret the meaningful reasons for features in a dataset.

    You also pay attention to the detail of underlying assumptions, limits and exceptions when describing a system.

    You are familiar with a variety of mathematical methods for describing dynamic systems and are highly skilled in using software that implements these.

    You use a variety of graphical and numeric techniques to verify that you are delivering a high quality result that can be used to predict and optimise future performance.

    When you are on the team, if there is information that can be gleaned from a system, you will find it.

  • 相关阅读:
    基于redis实现的延迟消息队列
    Redis实现求交集操作结果缓存的设计方案
    限流算法之漏桶算法、令牌桶算法
    Apache设置防DDOS模块mod_evasive
    FastCGI技术
    详解强大的SQL注入工具——SQLMAP
    nginx根据域名做http,https分发
    Nginx配置SSL证书部署HTTPS网站
    JProfiler学习笔记
    Mysql压测工具mysqlslap 讲解
  • 原文地址:https://www.cnblogs.com/oDoraemon/p/10804077.html
Copyright © 2020-2023  润新知