Category Archive
Predict

Datawatch Angoss 10.8 Released: Integration with Monarch Swarm

Datawatch Angoss 10.8 released! Key Features: Integration with Monarch Swarm Datawatch Angoss analytics software applications now integrate with Swarm: o Angoss datasets can be exported to Swarm o Swarm files can be imported into Angoss datasets o Angoss models can be published to Swarm (in PMML format). Supported model types for model export: Decision Trees, Logistic Regression, LinearRegression, Regularization, and Constrained Regression. ▪ Note: Swarm integration features require Python version 3.7 or…

Hindsight Isn’t Always 20/20 In Data Analytics

There’s a popular saying that hindsight is 20/20 but it isn’t always true when it comes to analytics.  To understand what really happened during the last month, quarter or year, you need to start with clean and complete datasets, utilizing the domain expertise from across the entire organization. And advancing your data strategy beyond descriptive “Hindsight” analytics to predictive or prescriptive analytics requires even more time, labor and analytic expertise.   For most organizations, this is often easier said than done. …

Optimization: Moving from Insight to Actionable Foresight

Authored By Chris Long Pre-Sales Leader, Datawatch Angoss You’ve probably seen this chart or one like it recently: Most organizations are finding their analytics efforts are somewhere between descriptive and predictive, few have been able to effectively move from only predictive to prescriptive and rely on rules of thumb or gut feel to apply analytic learnings. Many of those who’ve effectively moved into prescriptive analytics have started using segmentation tools like strategy trees. Strategy…

Why I Love the New Filtering Join Node

Ok, so full disclosure: at Angoss we have an idea wall for new product features, and I’m the one who wrote “Filtering Join” on a pink sticky note and placed it on the wall. So maybe I was destined to love the Filtering Join node even if it wasn’t great. But I do think it’s great. It’s an incredibly valuable node, and I use it all the time, and today I want to tell…

How to Include or Exclude Variables in a Strategy or Decision Tree

How to Include or Exclude Variables in a Strategy or Decision Tree March 6, 2018. by Dr. Eric Hart Including or excluding variables in a strategy or decision tree depends on whether you’re talking about an automatically grown tree, or an interactively grown tree. If you’re growing a decision tree automatically, you can’t force a variable into the tree. You can set which variables are eligible to be used in the tree, but the algorithm that performs the…

Data Preparation 101 – The Objective of Data Preparation

Data Preparation 101 – The Objective of Data Preparation May 2, 2018. by Dr. Mamdouh Refaat Data preparation is a fundamental aspect of the modeling process. In fact, it is the most important part of the process since it occupies up to 80% of the total time of the project. The objective of data preparation is to prepare what is known as the modeling view or mining view. The modeling view is the table that contain all…

Angoss 10.7.3 Released

Version 10.7.3 of Angoss Analytics Software Suite has been released for Windows and Red Hat Linux 7. This includes KnowledgeSEEKER, KnowledgeSTUDIO, KnowledgeREADER, and KnowledgeSTUDIO for Apache Spark.   What’s New in Version 10.7.3: Decision Trees and Ensemble Tree Models Two new measures added in Decision Trees: Unadjusted log p-value and Bonferroni-adjusted log p-value. R and Python Language Integration R integration is now supported for R versions 3.5.x. Platform Support Angoss software is…

How do I change an Ordinal Variable to a Continuous Variable?

How do I change an Ordinal Variable to a Continuous Variable? March 13, 2018. by Dr. Eric Hart Not all ordinal variables can be treated as continuous. If the variable you’re considering is the result of a survey where people listed their satisfaction on a scale of 1 to 5, it doesn’t make sense to try to change this variable to a continuous variable, it is an ordinal variable, and should always be treated…

Differentiating Machine Learning, Artificial Intelligence, Data Science, Data Mining and Predictive Analytics

You can enter this question in an internet search engine and check what answers you get. Rather than doing that I will answer this question simply by “rewriting” history, so I can say it is “based on a true story”. During the Second World War, Alan Turing and his colleagues at Bletchley Park succeeded in building a machine that used electromechanical components to decipher the communications to and from German submarines operating in the Atlantic.

How to Save $22k Per Data Analyst

A recent blog post from James Haight at Blue Hills Research took a look at the average amount of time – and money – spent on inefficient data prep. The results were pretty sobering, to the tune of two hours per day spent on data prep which equates to roughly $22,000 per year, per analyst. That’s a pretty hefty price tag for a low value-add activity. Imagine if you had a team of…