Jacob J. Walker's Blog

Scholarly Thoughts, Research, and Journalism for Informal Peer Review

Archive for the ‘Data Science’ Category

Delaying 5 Ways Data Science can Help Education Series

without comments

I’m delaying my series about 5 ways Data Science can help education…  I actually didn’t mean to post Saturday’s intro to the series, but I often post things ahead of time…  And I just forgot to stop it from publishing….  Also, this week there has been some activity about educational content standards that I’m involved with, that I hope to post about…

Written by Jacob Walker

May 16th, 2017 at 6:46 am

5 Ways Data Science and the Intelligent Web can help Schools and Education

without comments

5 ways to...This past week, I shared a simplified introduction to what is done with data science/machine learning/data mining/predictive analytics work, and the major tasks / roles.  This coming week I’m going to share about how I think data science combined with the “intelligent web” (sometimes called Web 3.0 or above) can benefit human education and thus humanity.  Some of these ideas can be done at the school level, others are probably better done by vendors, and yet others are best done by governmental organizations or associations.  But each of them can make a big difference, if done well and ethically.  And to not keep you too much in suspense, here are the ones I’ll be posting about this week:

Read the rest of this entry »

Data Artistry: Using and Sharing the Knowledge in an Effective Manner

with 2 comments

Can you picture that?” – Dr. Teeth and The Electric Mayhem

The final stage of doing data science/machine learning/data mining/predictive analytics is to use the results, which generally involves some form of communication to one or more types of audiences.  This, I will term “data artistry”. (This is not necessarily a common term used, but it does have some precedence in specific contexts)

Read the rest of this entry »

Written by Jacob Walker

May 12th, 2017 at 11:59 am

Data Mining: Discovering Gold in your Data

with one comment

There’s gold in dem dere data!” – Adaptation of the original quote from M. F. Stephenson

After the data has been gathered and in a form that can be used, it can then have an appropriate algorithm used to accomplish the data mining/machine learning/predictive analytics. This is the stage that traditionally has been called “data mining” because it is the part that gets additional value from the data in the form of some type of knowledge (this is why early on, the process was sometimes called “knowledge discovery in data” (KDD).

Read the rest of this entry »

Written by Jacob Walker

May 11th, 2017 at 11:59 am

Data Wrangling: Gathering the Data You Need in a Form You Can Use

with one comment

Data! Data! Data!’ I can’t make bricks without clay.” – Sherlock Holmes

Before data science/machine learning/data mining/predictive analytics can be done, you need to have the data you are going to use.  This may see obvious, but in many cases there is more to this step than may first be assumed, and the whole process is what I will call “data wrangling”, although has other names like “data munging”.

Read the rest of this entry »

Written by Jacob Walker

May 10th, 2017 at 11:59 am

Data Surfing: The Oft Forgotten First Stage of Discovery

with one comment

You got to drift in the breeze before you set your sails. It’s an occupation where the wind prevails. Before you set your sails drift in the breeze.” – Paul Simon

Many texts about data science (including machine learning, data mining, and predictive analytics) don’t include much about the very first step of the process, which is the step where you come up with what your goal is for your other steps.  In traditional science, this might be called the step of making your hypothesis.

Read the rest of this entry »

Written by Jacob Walker

May 9th, 2017 at 11:59 am

The Four Major Activities of Data Science / Machine Learning

without comments

Recently there was a post on LinkedIn by Erle Hall, lead for the Information and Communication Technologies (ICT) for the California Department of Education (CDE) with a diagram about machine learning.  That diagram had 6 steps: Select Data, Model Data, Validate Model, Test Model, Use the Model, and Tune Model.    Those 6 steps mostly encapsulate what traditionally has been called the “data mining” phase.  But there are 3 other important phases, which I will call “data surfing”, “data wrangling” and “data artistry”.  (These names were chosen to be easier to understand and more interesting for students, but also go by different names)  I also personally prefer to use the term “algorithm” instead of “model”, because while traditionally in data science, statistical models were used, there are now often times methods like neural networks and other such algorithms that are less like a traditional statistical model.  In the next few posts, I’ll dive into each of these 4 steps, and give a basic explanation of what each step does, and why the step is important.

Written by Jacob Walker

May 8th, 2017 at 11:59 am

Why Johnny Can’t Compute: The Failure of the Old Math

without comments

Why Johnny Can't ComputeNearly every leader in our nation is saying that we need to have students get more STEM education (Science, Technology, Engineering, and Math), so that our country will not fall behind technologically and economically from the rest of the world. But, what they don’t say (possibly, because they don’t know), is that the type of math that is needed for Information and Communication Technologies (ICT) and Computer Science (CS) is not the math that is normally taught in high school.

Read the rest of this entry »

Thought of the Day: “Probability theory is telling us something about the way our own minds operate”

without comments

I have started to read the book Probability Theory: The Logic of Science, by the late E. T. Jaynes.  From what I understand so far, I think there is a high plausibility that it will help me have a more unified and deeper understanding of probability (and hence statistics).   In reading the preface, he makes some interesting observations about probability and human thinking, and it seems quite apropos, and relevant to the recent advances in the fields of artificial intelligence, such as the recent match of Go.

A quote from the book that particularly struck me was the following:

… it is clear that probability theory is telling us something about the way our own minds operate when we form intuitive judgments, of which we may not have been consciously aware. Some may feel uncomfortable at these revelations; others may see in them useful tools for psychological, sociological, or legal research.

Written by Jacob Walker

March 10th, 2016 at 11:59 am

The Introduction/Background to my Revised Doctoral Research Proposal

with one comment

Compendium_of_Countries_LogoToday, I finished revising the first part of my doctoral research proposal, as there have been several underlying methodological and technological changes from the original proposal.  While I know doctoral research is usually not of general interest, I am still going to be posting the sections of my revised proposal as I finish them, for those who are interested.  Please feel free to ask questions if you have them, and I will do my best to explain statistical techniques or the technology, etc. that I’m talking about.

Read the rest of this entry »

Written by Jacob Walker

February 20th, 2016 at 6:12 pm