was successfully added to your cart.

The First Data Scientist

Scientists discover and become famous. Engineers invent and are recognized for their inventions.

Entrepreneurs innovate, creating businesses that simply follow in the path of discoveries and inventions. Electricity is the scientific discovery that is the origin of modern-day data science.  About 200 years ago several engineers began working on machines that created electrical sparks. They connected their simple spark-generating machines with wire.  In 1844, Samuel Morse achieved recognition for his telegraph machine and Morse Code by demonstrating its long-distance communications capability.  He and others electrified data. “Electric data” rivals the discovery of “fire” in terms of advancing human civilization. The message, that Morse sent in his 1844 demonstration was “What hath God Wrought?” suggesting he understood the transformational nature of his work.

The telegraph solved a communications problem. A group of entrepreneurs commercialized this technology: Western Union implemented the first transcontinental electronic communication network.  The electrification of data sparked both electronic communications and computing. So, was the first Data Scientist an ancient Sumerian, Egyptian or Greek mathematician or an electrical engineer? In my book, Samuel Morse is the father of modern data science. He was not a mathematician; he was an artist.  Samuel Morse lived at the height of the Industrial Revolution.

Human curiosity fuels creativity the source of our ideas. Humans refine their ideas by pursuing their curiosity.

Today, many of us associate Data Science with Machine Learning and AI (Artificial Intelligence) a.k.a. computer programing. We are programing computers to respond to data inputs.  Computers can learn fast respond immediately to data inputs.  AI is simply this programmed response to what is learned.  Humans respond to ideas and are not easily programmed.  So too, we are not programming computers to act on their own ideas.   I submit the value of Machine Learning is not in producing Artificial Intelligence. Its value is in advancing Human intelligence in the hopes of sparking an idea, or a shocking new discovery.

Early childhood learning starts with teaching a child to read and count. Computers learned to count employing a binary numbering system. Humans rely on computers for computational accuracy in processing numerical data.   We are rapidly improving computer reading-comprehension. The technology that improves reading comprehension is Natural Language Processing (NLP).   Imagine the possibilities that will come from accurate reading-comprehension. I imagine having dozens of computers reading thousands of pages of textual matter and consolidating what is learned in a subject-oriented Aggregate Knowledge Graph, a continuous process of “precision learning” that produces up-to-date reliable knowledge regarding an important topic.  This might be the answer to the question that Samuel Morse posed so long ago.

Metaphorically speaking, from the Cloud we are experiencing a torrential downpour of information. We must now turn this information into reliable knowledge that can be used to be confident in pursuing our ideas and making decisions. Human intelligence is not artificial, it’s 100% natural, and organic.  Data Scientists are extending our Knowledge Advantage over intelligent machines.

I do not consider myself to be a data “scientist”; I am more of a “mechanic”.  If you would like to learn more about the NLP tools, being used to improve computer reading-comprehension of medical data,  go to: https://www.johnsnowlabs.com/  or contact me to share your ideas: Richard@johnsnowlabs.com.

Precision Learning and Human Intelligence

How useful was this post?

Try Healthcare NLP

See in action
Our additional expert:
A serial software entrepreneur, Tanler authored a book (The Intranet Data Warehouse) as a way to establish his Company - Information Advantage, Inc. as a leader in what is now commonly referred to as "Business Intelligence" and “Data Warehousing". Tanler navigated his company from startup through an Initial Public Offering. He was recognized by Ernst & Young and named entrepreneur of the Year for the Midwest Region. “I love data, I mean I love those tiny little ‘bits’ of electricity, the sparks we turn into information. “ “Today, we are in the midst of a Digital Industrial Revolution. Some say Data in the ‘new oil’ powering digital industries. It’s not! The ‘new oil’ is electricity! Data is more like oxygen! It’s vital to human survival. From data we produce a torrential downpour of information. We translate information into the knowledge we share. This is a process of learning and when we commit what we learn to memory, we add to our intelligence. This how we have advanced human intelligence. Machines can be taught to learn, that’s step #1 Preserving what is learned as knowledge and committing it to memory is Step #2. In other words, Machine Learning is creating databases, Knowledge Graphs. It all starts with a supply of “oxygen”. That is how important data is today and has been throughout history. NLP Natural Language Processing is a branch of Machine Learning, used to extract the knowledge embedded in communications. The computerized knowledege Memory System for storing knowledge is a brain-like Knowledge Graph. Machine Learning produces Machine Intelligence for autonomous decision-making (AI) and to advance human Intelligence.

Automating Clinical Trial Master File Migration & Information Extraction

Pharmaceutical Companies who conduct clinical trials, looking to get new treatments to market as quickly as possible, possess a high volume of...