Data Lake Practical Techniques: Part 1 – Introduction

Data Lake Practical Techniques: Part 1 – Introduction

“Data Lake” is one of the current hot topics in Big Data with lots of press and little practical content. A big part of the confusion and controversy arises from the tension between hype and common sense.  Proponents promise phenomenal insights and actionable predictions about customer behavior, employee behavior, doctors’ handwritten diagnostic notes and so on. Basically, they claim you dump documents and data into the lake and let the algorithms make sense of it all. Of course, that’s patent nonsense. Unfortunately, the hype obscures some real value in the data lake concept. In this and following posts we’ll see … Read More

Dimension Filters in Tableau

Dimension Filters in Tableau

Being able to toggle between dimensions or measures in Tableau is a nice feature to have when building a story for users or just on the dashboard. Being able to give the user the ability to do this is easy and can be done in a couple of steps. The first step to creating a toggle is to “Create Parameter…” in Tableau by right clicking on the left side of the program: To create the parameter here you have to change the data type to String and allowable values to list. Once you’re done with that you can type in … Read More

Hosting a Shiny App in Tableau

Hosting a Shiny App in Tableau

Graphics in both Tableau and Shiny have their pluses and minuses. Tableau is good for drag and drop graphics that is easy to change the types of graphs given the situation with little to no coding. R/Shiny graphics are completely customizable from 3D graphics to violin plots, which is a boxplot that incorporates a probability density function. Also, with Shiny you can get summaries of the data and perform many other advanced statistical methods; beware of running analyses in Shiny without checking the proper diagnostic tests related to the data at hand (this is my statistics background speaking). Where Tableau … Read More

Tips on Developing Shiny Apps in RStudio Server

Tips on Developing Shiny Apps in RStudio Server

Development from the RStudio Server is seamless to its desktop counterpart with the only difference being that you access it through your web browser. In fact, RStudio Server has some advantages because all you need is a computer that can connect to the internet and you’re using all of the server’s resources for computations rather than your local machine. This means that you can develop R code using a chromebook or tablet, in fact, I connected to my RStudio server on my iPhone and could code if I wanted but quickly found that it would be more of a hassle than it’s … Read More

Deploying a Shiny App on Shiny Server

Deploying a Shiny App on Shiny Server

RStudio provides a free version on their Shiny Server for all to use. One of the drawbacks of the free version is that it will be public facing and you will not have a login. If you want to use Shiny Server to host sensitive data you will likely want to purchase the license to help protect your data. Getting a Shiny app hosted is easy, all you need to do is move your Shiny files to the following directory: /srv/shiny-server/ Once it’s there you will be able to share your apps with anyone with the following URL  http://<hostname>:3838/APP_NAME/.

Industrial IoT and Machine Learning Webinar

Industrial Iinternet of Things and Machine Learning

Making Wind Energy Cost Competitive, sponsored by the Independent Oracle Users Group This is a case study of the Fluitec Wind system that M&S Director of Data Science, Bob Liekar, lead and will be presenting on Wednesday, May 13, 12:45pm–2:00pm EDT. About The Webinar The Fluitec Wind Tribo-Analytics system is a production application that utilizes multiple Big Data disciplines. This case study presents a real-world application of Big Data and IoT (Internet of Things). The techniques and algorithms can be applied to many other industries. From Fluitec: “Wind turbines are expensive to operate. They are remote and distributed with highly … Read More

What I Learned From Oracle’s BIWA Summit 2015 (Perspective from a Consulting Intern)

IMG_1192

Our Big Data Team, here at M&S, packed up their bags and headed to San Francisco last week for the 2015 Oracle Business Intelligence Warehousing and Analytics Summit. Some of the major topics discussed included: Internet of Things (IoT), Big Data, and Advanced Analytics. The Internet of Things (IoT) is at the peak of Gartner’s Hype Cycle. It is becoming increasingly talked about and there are predictions of billions and billions of sensors by 2020. But what exactly is IoT? IoT involves embedded computing devices interconnected with existing internet infrastructure, such as, wearables and connected cars. Why the hype right … Read More

Presidential State of the Union Sentiment Analysis

State of the Union Speech Analysis 2015 Barak Obama

Sentiment analysis is a hot technology in marketing. This analysis parses comments and feedback left on corporate and social media sites and attempts to discern the tone or attitude expressed by the commenter. For example, reviews and comments about a company’s product such as “I love these shoes” represent a strong positive statement about how the customer feels about that product. The challenge is that the comments are just free-form text in English which computers traditionally had a hard time deciphering. M&S Big Data thought it would be interesting to apply this sentiment analysis to our Presidents. As a start … Read More