R in Data Profiling and Cleansing


Data Profiling and data cleansing is one of the essential steps in data processing. Poor data quality and analysis on dirty data is a primary reason for business insights failure. Most of the time in project timeline spent in cleaning, quality check, standardize the data and right format for use. This process also collectively called … Continue reading

Using R in Extract , Transform and Load


Business Intelligence is umbrella term includes ETL, Data Manipulation, Business Analytics, Data Mining and Visualization. It may relate with other trending statistics techniques. Lets study most commonly used techniques in BI and applies to achieve our goal by building our sample BI Application. We will use R Language (Open source Software for statistical computing and graphics) to … Continue reading

SlideShare Web Traffic Data Visualization using R


I have extracted my Slide share Web Data and did some  Data wrangling and Visualization.You can do the same by downloading the data from your slide share account . R Code ( Other unwanted code snippets also exists which can be omitted ) and File Definition available at https://github.com/kannandreams/slideshare Basic Data Cleansing process  : Missing values Group … Continue reading

My version of Walk to Remember


I have started this blog some years back and moved away from old blog which was purely about Oracle. My thought is  to post about my own perspective of life , Technology , entrepreneurship and share my knowledge with this world. But my bad . Still I have not posted anything other than technology. Ok, Now I felt I should do that and … Continue reading

Hypothesis Formulation


About Hypothesis :  Part 1 : https://www.youtube.com/watch?v=cpL38ZeIecE Part 2 : https://www.youtube.com/watch?v=7l6K0V_x_hw Hypothesis Testing steps : Hypotheses ( Null and Alternative ) Significance ( Alpha value –  0.05 to know the level of Significance or probability to say correct or wrong) Sample ( Take the sample values ) p-Value ( Calculate the p-value ) Decide ( use the p-value … Continue reading

Chi Squared Distribution


Chi Squared Distribution or Chi Square represented with symbol χ² . The Chi Square distribution is the distribution of the sum of squared standard normal deviates. As the degrees of freedom increases, the Chi Square distribution approaches a normal distribution Chi Square distributions are positively skewed Skew decreases as the degrees of freedom increases. It Helps to understand … Continue reading