BIG DATA PROCESSING FOR DECISION MAKING

Size: px
Start display at page:

Download "BIG DATA PROCESSING FOR DECISION MAKING"

Transcription

1 UDC K.K. Nurlybayeva 1, G.T. Balakayeva 2 ( 1 al-farabi Kazakh National university, Almaty, Kazakhstan, Kalamkas.nurlybayeva@gmail.com 2 al-farabi Kazakh National university, Almaty, Kazakhstan) BIG DATA PROCESSING FOR DECISION MAKING Abstract. Nowadays there is a growing problem of mining large amounts of data. This article is dedicated to description of the methods and techniques which are focused to solve these problems. There are some Data Mining algorithms are described in the paper. This article examines the latest developments in data analysis, as well as the benefits of analyzing large volumes of data for businesses. The article also describes proposals for the optimization of data processing systems and integrating them into a single infrastructure for a more rapid and "smart" business decision making. Key words. Big data, Data mining, regression, classification, association, OLAP. Nowadays, there is a big problem concerning the increase of the data volume. The concept of big data means that the volume of the data exceeds the volumes of the information systems. Additional methods and technologies of processing the data are needed when its volumes becomes more than terabyte or petabyte. It is clear that the algorithms which are suitable for small amounts of data are not appropriate for handling big data; they are not fast and efficient enough for it. Lots of information collected in data warehouses of the world s enterprises and companies [1]. The increasing of information volume continues each year and there are number of problems, which are still open to everyone: Storage of data requires certain financial costs for equipment, maintenance, backups, etc. Data processing is becoming more complex and it spent more and more resources. Nevertheless, big data analysis can be very beneficial for most of the interested parties. Big data is the source of interest for analysts who make decisions relying upon historical data [2]. They build necessary reports to be able to get the required information for analyzing and decision making. In addition, there are companies that are interested in obtaining benefits from the information stored in systems that process big data. This article examines the latest developments in data analysis, as well as the benefits of analyzing large volumes of data for businesses. The article also describes proposals for the optimization of data processing systems and integrating them into a single infrastructure for a more rapid and "smart" business decision making. The increase of the information amounts becomes the consequence of the improvement of the data recording and service technologies in the variety of the fields. The activity of almost each enterprise is accompanied by the registration of client s information, i.e. medical, commercial, industrial, scientific organizations. The question is what for this information is needed. There is no interest in raw flow of information without appropriate processing and analysis. Many analytical tools are available nowadays, however several years ago there weren t capability to handle such amounts of data or it was very expensive [3]. New and evolving analytical processing technologies now make possible what was not possible before. Examples include: New systems those are able to process a wide variety of unstructured data. Improved analytical capabilities including predictive and text analytics. Operational business intelligence that improves business flexibility by enabling automated realtime actions and intraday decision making. Cloud computing services. The system for processing big data should combine these technologies to enable new solutions that can bring significant benefits to the business [4]. In addition, to handle big data the system should represent a wide range of new analytical technologies and business possibilities. Examples include technologies such as: Design of predictive models Fraud detection Risk Analysis Construction of situational rooms

2 On-line analytical processing, etc. As world experience shows storage system and business process management should be reorganized according to the necessity of the company. For example in some cases there is no need in raw data. Therefore, the data which is saved in the database should be preprocessed and transformed. This particular measure will optimize the storage place and cost. Data mining technologies should be implemented in this case. Mathematical statistics formerly has been used as a primary tool for data analysis. However, in connection with the problems associated with data processing, statistical methods were not sufficient for analysis. Statistical methods are useful mainly for checking hypotheses (verification-driven data mining) and "rough" exploratory analysis, which is the foundation of online analytical processing (online analytical processing, OLAP). There are wide ranges of usage of the Data mining technology. It is used everywhere where data is present. The main areas where it is very important to use Data mining are marketing, credit scoring, fraud detection, any type of forecasting, etc [5]. There are five methods of Data mining that should be mentioned. Association Sequence Classification Clustering Forecasting Regression Association takes place when several occasions are related with each other. Data mining technologies allow determine the patterns of associative rules, which then can be used for knowledge database formation in the decision making systems. Sequence appears in case of the chain of the timely related occasions. Classification has the aim to solve the problem of sorting the separate occasion to the class of the existing occasions, by determining its number. Clustering is used in case of the finding the final number of clusters or classes which divide the set of occasions the particular non-intersecting subsets. Forecasting is used in every field in order to define future benefits from new product. Regression analysis is a statistical process for estimating the relationships among variables [6]. In probability theory and mathematical statistics, it is a dependence of the average value of a random variable from some other value or even several. In contrast to the purely functional dependence y = f(x), where each value of the independent variable x is the unique value of the dependent variable y, regression dependence implies that each value of the variable x may correspond to different values of y, due to the random nature of dependence. If there are dependence such as to some value of x corresponds a set of values {y, y,, y }, then the dependence is arithmetic from the x and it is a statistical regression: y = (y, y,, y ) n Regression study in probability theory based on the fact that the random variables X and Y, with joint probability distribution associated probabilistic dependence: for every fixed value X = x, the value of Y is a random variable with a certain (depending on the value of x) conditional probability distribution. Regression of Y on X is determined by the value of the conditional expectation Y, calculated under the condition that X = x : E(Y x) = u(x). The equation y = u(x) is a regression equation. Regression lines have the following remarkable property among all real functions f(x) a minimum expectation E[Y f(x)] is for a function f(x) = u(x). This means that the regression on Y by X provides the best in this sense on the representation of Y value X. This property allows the use regression for prediction value of Y by X. In other words, if the Y value is not directly observed and the experiment allows to record only X, then as predicted value Y can use the value of Y = u(x). The simplest case is when the regression dependence of Y on X is linear, for example E(Y x) = b + b x, where b and b - regression coefficients. In practice, the regression coefficients in the equation y=u(x) are unknown, and they are measured from the observed data.

3 Figure 1. Regression line Regression is widely used in analytical techniques to solve various business problems, such as forecasting (sales, exchange rates and equity), evaluation of various business indicators for the observed values of other indicators (scoring), identifying relationships between indicators, etc. Differences of Data Mining from other methods of data analysis Traditional methods of data analysis (statistical methods) and OLAP (Online Analytical Processing Systems) is that it mainly focuses on verification of pre-formulated hypotheses (verification-driven data mining) and the "rough" exploratory analysis, which underpins the online analytical processing (OnLine Analytical Processing, OLAP), while one of the main provisions of the Data Mining - find non-obvious relationships. Data Mining tools may find these patterns on their own and also build their own hypotheses about relationships. Since it is the formulation of hypotheses about relationships is the most difficult task, Data Mining advantage over other methods of analysis are obvious. Most statistical methods for identifying relationships in data using the concept of averaging over the sample, which leads to operations on non-existent values, whereas Data Mining operates the real values. OLAP is more suitable for retrospective understanding of historical data, Data Mining based on historical data to answer questions about the future.

4 Figure 2. Main stages of data processing Perspectives of Data Mining technology Potential Data Mining provides a tremendous opportunity for expanding the frontiers of technology. Development of Data Mining concerns the following areas: selection of types of subject areas, which will facilitate the formalization of the decision of the relevant tasks Data Mining, relating to these areas; establishment of formal languages and logical means by which arguments will be formalized and automated tool that will solve problems Data Mining in specific subject areas; development of methods for Data Mining, able not only to extract patterns from data, but also to form some theories based on empirical data; addressing the significant backlog of opportunities of Data Mining tools from theoretical achievements in this field. It is evident that the development of Data mining technology is the most directed to the areas related to business. In the short term Data Mining products can become as ordinary and necessary, as e- mail, and, for example, be used by users to find the lowest prices on certain goods or the cheapest tickets. In the long term future of Data Mining is really interesting - it can be to find intelligent agents as new treatments of various diseases, and a new understanding of the nature of the universe. However, Data Mining contains a potential danger - in fact more and more information becomes available through a worldwide network, including information of a private nature, and more knowledge is possible to get out of it. Areas where the use of technology Data Mining, is likely to be successful have these features: require decisions based on knowledge; have a changing environment; are accessible, adequate and meaningful data; provide high returns from the right decisions. There are several points of view on Data Mining nowadays. Supporters of one of them consider it a mirage, distracting from the classical analysis. Supporters of the other direction - that those who accept the Data Mining as an alternative to the traditional approach to the analysis. There is also the middle, where we consider the possibility of sharing the latest achievements in the field of Data Mining and classical statistical analysis of data. Data Mining technology is constantly evolving, is attracting increasing interest from both the scientific world, and from the applications of technology in business. Integration of new technologies such as Data Mining and others into a single infrastructure will help to achieve more rapid and smart business decision making. REFERENCES 1. Randal E. Bryant, Randy H. Katz, Berkeley Edward D. Lazowska. Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society V.8 2. Bollier D. The Promise and Peril of Big Data Washington: The Aspen Institute, p. 3. Heemink A. Mathematical Theory of Data Processing in Models (Data Assimilation Problems) Vol p. 4. Moran W., La Scala B. Measurements in Mathematical Modeling and Data Processing Vol p. 5. K.K. Nurlybayeva, G.T. Balakayeva. Simulation of Large Data Processing for Smarter Decision Making. AWERProcedia Information Technology and Computer Science. 3 rd World Conference on Information Technology WCIT Vol 03, p. 6. K.K. Nurlybayeva, G.T. Balakayeva. Processing of large amounts of data on a credit scoring example using neural network technology. Safety and Security Engineering V p. Нурлыбаева К.К., Балақаева Г.Т. Big making data processing for decision Түйіндеме. Осы уақытта мәлiметтердiң көлемдерiн үлкеюi өндiрудiң өсетiн сұрағында болады Бұл мақала осы мәселелердi шешуге бағдарлаған әдістердің және қабылдаулардың сипаттамасына арнаулы. Кейбір зияткерлік талдаудың алгоритмдер мақалада суреттелiп айтылған. Осы мақалада үлкен көлемдердің талдаудың артықшылықтары осы кәсіпкерлік үшін талдауға облыста өте жаңа әзірлеулер қаралып жатыр. Мақалада сонымен бірге өңдеу жүйе ықшамдау бойынша ұсыныстар суреттеліп жатыр және «ақылды» кәсіпкерлік-шешімдері тезірек қабылдануы үшін біртұтас инфрақұрылымына олардың кірігуі. Маңызды сөздер: Үлкен деректер, деректерді өңдеу, регрессия, топтастыру, қауымдастық, OLAP.

5 Нурлыбаева К.К., Балакаева Г.Т. Большие объемы данных для принятия решений Резюме. В настоящее время существует растущая проблема обработки больших объемов данных. Эта статья посвящена описанию методов и приемов, которые ориентированы на решение этих проблем. Некоторые алгоритмы интеллектуального анализа данных описаны в статье. В данной статье рассматриваются новейшие разработки в области анализа данных, а также преимущества анализа больших объемов данных для бизнеса. В статье также описывается предложения по оптимизации системы обработки данных и их интеграции в единую инфраструктуру для более быстрого принятия «умных» бизнес-решений. Ключевые слова. Большие данных, обработка данных, регрессия, классификация, ассоциация, OLAP.

Data Warehousing and Data Mining in Business Applications

Data Warehousing and Data Mining in Business Applications 133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business

More information

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Nine Common Types of Data Mining Techniques Used in Predictive Analytics 1 Nine Common Types of Data Mining Techniques Used in Predictive Analytics By Laura Patterson, President, VisionEdge Marketing Predictive analytics enable you to develop mathematical models to help better

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

ANALYTICS CENTER LEARNING PROGRAM

ANALYTICS CENTER LEARNING PROGRAM Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014 Big Data Analytics An Introduction Oliver Fuchsberger University of Paderborn 2014 Table of Contents I. Introduction & Motivation What is Big Data Analytics? Why is it so important? II. Techniques & Solutions

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

Algorithmic Scoring Models

Algorithmic Scoring Models Applied Mathematical Sciences, Vol. 7, 2013, no. 12, 571-586 Algorithmic Scoring Models Kalamkas Nurlybayeva Mechanical-Mathematical Faculty Al-Farabi Kazakh National University Almaty, Kazakhstan Kalamkas.nurlybayeva@gmail.com

More information

Nagarjuna College Of

Nagarjuna College Of Nagarjuna College Of Information Technology (Bachelor in Information Management) TRIBHUVAN UNIVERSITY Project Report on World s successful data mining and data warehousing projects Submitted By: Submitted

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Course #: BIA 686. Catalog Description:

Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Course #: BIA 686. Catalog Description: Course Title: Program: Applied Analytics in a World of Big Data Business Intelligence and Analytics (BI&A) Course #: BIA 686 Instructor: Dr. Chris Asakiewicz Catalog Description: Business intelligence

More information

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Paper Jean-Louis Amat Abstract One of the main issues of operators

More information

Business Intelligence. Data Mining and Optimization for Decision Making

Business Intelligence. Data Mining and Optimization for Decision Making Brochure More information from http://www.researchandmarkets.com/reports/2325743/ Business Intelligence. Data Mining and Optimization for Decision Making Description: Business intelligence is a broad category

More information

Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Course #: BIA 686. Catalog Description:

Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Course #: BIA 686. Catalog Description: Course Title: Program: Applied Analytics in a World of Big Data Business Intelligence and Analytics (BI&A) Course #: BIA 686 Instructor: Dr. Chris Asakiewicz Catalog Description: Business intelligence

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

DATA MINING TECHNIQUES AND APPLICATIONS

DATA MINING TECHNIQUES AND APPLICATIONS DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,

More information

The Internet of Things and Big Data: Intro

The Internet of Things and Big Data: Intro The Internet of Things and Big Data: Intro John Berns, Solutions Architect, APAC - MapR Technologies April 22 nd, 2014 1 What This Is; What This Is Not It s not specific to IoT It s not about any specific

More information

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS 9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence

More information

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents

More information

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 FREE echapter C H A P T E R1 Big Data and Analytics Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 percent of the data in the

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK Agenda Analytics why now? The process around data and text mining Case Studies The Value of Information

More information

Data Mining for Fun and Profit

Data Mining for Fun and Profit Data Mining for Fun and Profit Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. - Ian H. Witten, Data Mining: Practical Machine Learning Tools

More information

CAS CS 565, Data Mining

CAS CS 565, Data Mining CAS CS 565, Data Mining Course logistics Course webpage: http://www.cs.bu.edu/~evimaria/cs565-10.html Schedule: Mon Wed, 4-5:30 Instructor: Evimaria Terzi, evimaria@cs.bu.edu Office hours: Mon 2:30-4pm,

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

IDENTITY AND ACCESS MANAGEMENT- A RISK-BASED APPROACH. Ion-Petru POPESCU 1 Cătălin Alexandru BARBU 2 Mădălina Ecaterina POPESCU 3

IDENTITY AND ACCESS MANAGEMENT- A RISK-BASED APPROACH. Ion-Petru POPESCU 1 Cătălin Alexandru BARBU 2 Mădălina Ecaterina POPESCU 3 IDENTITY AND ACCESS MANAGEMENT- A RISK-BASED APPROACH Ion-Petru POPESCU 1 Cătălin Alexandru BARBU 2 Mădălina Ecaterina POPESCU 3 ABSTRACT In this paper we stress out the importance of identity and access

More information

Big Data. Fast Forward. Putting data to productive use

Big Data. Fast Forward. Putting data to productive use Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

III JORNADAS DE DATA MINING

III JORNADAS DE DATA MINING III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

The University of Jordan

The University of Jordan The University of Jordan Master in Web Intelligence Non Thesis Department of Business Information Technology King Abdullah II School for Information Technology The University of Jordan 1 STUDY PLAN MASTER'S

More information

Performing a data mining tool evaluation

Performing a data mining tool evaluation Performing a data mining tool evaluation Start with a framework for your evaluation Data mining helps you make better decisions that lead to significant and concrete results, such as increased revenue

More information

E-Learning Using Data Mining. Shimaa Abd Elkader Abd Elaal

E-Learning Using Data Mining. Shimaa Abd Elkader Abd Elaal E-Learning Using Data Mining Shimaa Abd Elkader Abd Elaal -10- E-learning using data mining Shimaa Abd Elkader Abd Elaal Abstract Educational Data Mining (EDM) is the process of converting raw data from

More information

Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997

Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997 1 of 11 5/24/02 3:50 PM Data Mining and KDD: A Shifting Mosaic By Joseph M. Firestone, Ph.D. White Paper No. Two March 12, 1997 The Idea of Data Mining Data Mining is an idea based on a simple analogy.

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

Navigating Big Data business analytics

Navigating Big Data business analytics mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what

More information

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept Statistics 215b 11/20/03 D.R. Brillinger Data mining A field in search of a definition a vague concept D. Hand, H. Mannila and P. Smyth (2001). Principles of Data Mining. MIT Press, Cambridge. Some definitions/descriptions

More information

Data Mining + Business Intelligence. Integration, Design and Implementation

Data Mining + Business Intelligence. Integration, Design and Implementation Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution

More information

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI Yudho Giri Sucahyo, Ph.D, CISA (yudho@cs.ui.ac.id) Faculty of Computer Science, University of Indonesia Objectives

More information

Direct Marketing of Insurance. Integration of Marketing, Pricing and Underwriting

Direct Marketing of Insurance. Integration of Marketing, Pricing and Underwriting Direct Marketing of Insurance Integration of Marketing, Pricing and Underwriting As insurers move to direct distribution and database marketing, new approaches to the business, integrating the marketing,

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1 Overview Main principles of data mining Definition

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

Sanjeev Kumar. contribute

Sanjeev Kumar. contribute RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a

More information

Reinventing Business Intelligence through Big Data

Reinventing Business Intelligence through Big Data Reinventing Business Intelligence through Big Data Dr. Flavio Villanustre VP, Technology and lead of the Open Source HPCC Systems initiative LexisNexis Risk Solutions Reed Elsevier LEXISNEXIS From RISK

More information

Better planning and forecasting with IBM Predictive Analytics

Better planning and forecasting with IBM Predictive Analytics IBM Software Business Analytics SPSS Predictive Analytics Better planning and forecasting with IBM Predictive Analytics Using IBM Cognos TM1 with IBM SPSS Predictive Analytics to build better plans and

More information

IBM's Fraud and Abuse, Analytics and Management Solution

IBM's Fraud and Abuse, Analytics and Management Solution Government Efficiency through Innovative Reform IBM's Fraud and Abuse, Analytics and Management Solution Service Definition Copyright IBM Corporation 2014 Table of Contents Overview... 1 Major differentiators...

More information

Howe School of Technology Management. Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Proposed Course #: BIA 686

Howe School of Technology Management. Applied Analytics in a World of Big Data. Business Intelligence and Analytics (BI&A) Proposed Course #: BIA 686 Revised: February 20, 2012 School: Course Title: Program(s): Howe School of Technology Management Applied Analytics in a World of Big Data Business Intelligence and Analytics (BI&A) Proposed Course #:

More information

Get to Know the IBM SPSS Product Portfolio

Get to Know the IBM SPSS Product Portfolio IBM Software Business Analytics Product portfolio Get to Know the IBM SPSS Product Portfolio Offering integrated analytical capabilities that help organizations use data to drive improved outcomes 123

More information

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics

More information

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,

More information

ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies

ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining a.k.a. Data Mining II Office 319, Omega, BCN EET, office 107, TR 2, Terrassa avellido@lsi.upc.edu skype, gtalk: avellido Tels.:

More information

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information

More information

Data Mining: Motivations and Concepts

Data Mining: Motivations and Concepts POLYTECHNIC UNIVERSITY Department of Computer Science / Finance and Risk Engineering Data Mining: Motivations and Concepts K. Ming Leung Abstract: We discuss here the need, the goals, and the primary tasks

More information

Making confident decisions with the full spectrum of analysis capabilities

Making confident decisions with the full spectrum of analysis capabilities IBM Software Business Analytics Analysis Making confident decisions with the full spectrum of analysis capabilities Making confident decisions with the full spectrum of analysis capabilities Contents 2

More information

DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT

DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT Scientific Bulletin Economic Sciences, Vol. 9 (15) - Information technology - DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT Associate Professor, Ph.D. Emil BURTESCU University of Pitesti,

More information

Research of Postal Data mining system based on big data

Research of Postal Data mining system based on big data 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication

More information

Outline. What is Big data and where they come from? How we deal with Big data?

Outline. What is Big data and where they come from? How we deal with Big data? What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,

More information

Database Marketing simplified through Data Mining

Database Marketing simplified through Data Mining Database Marketing simplified through Data Mining Author*: Dr. Ing. Arnfried Ossen, Head of the Data Mining/Marketing Analysis Competence Center, Private Banking Division, Deutsche Bank, Frankfurt, Germany

More information

Life Insurance & Big Data Analytics: Enterprise Architecture

Life Insurance & Big Data Analytics: Enterprise Architecture Life Insurance & Big Data Analytics: Enterprise Architecture Author: Sudhir Patavardhan Vice President Engineering Feb 2013 Saxon Global Inc. 1320 Greenway Drive, Irving, TX 75038 Contents Contents...1

More information

MS1b Statistical Data Mining

MS1b Statistical Data Mining MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to

More information

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key

More information

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges A Basic Guide to Modeling Techniques for All Direct Marketing Challenges Allison Cornia Database Marketing Manager Microsoft Corporation C. Olivia Rud Executive Vice President Data Square, LLC Overview

More information

RESEARCH PAPERS FACULTY OF MATERIALS SCIENCE AND TECHNOLOGY IN TRNAVA SLOVAK UNIVERSITY OF TECHNOLOGY IN BRATISLAVA

RESEARCH PAPERS FACULTY OF MATERIALS SCIENCE AND TECHNOLOGY IN TRNAVA SLOVAK UNIVERSITY OF TECHNOLOGY IN BRATISLAVA RESEARCH PAPERS FACULTY OF MATERIALS SCIENCE AND TECHNOLOGY IN TRNAVA SLOVAK UNIVERSITY OF TECHNOLOGY IN BRATISLAVA 2013 Number 33 BUSINESS INTELLIGENCE IN PROCESS CONTROL Alena KOPČEKOVÁ, Michal KOPČEK,

More information

Executive Briefing White Paper Plant Performance Predictive Analytics

Executive Briefing White Paper Plant Performance Predictive Analytics Executive Briefing White Paper Plant Performance Predictive Analytics A Data Mining Based Approach Abstract The data mining buzzword has been floating around the process industries offices and control

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

Master of Mathematical Finance: Course Descriptions

Master of Mathematical Finance: Course Descriptions Master of Mathematical Finance: Course Descriptions CS 522 Data Mining Computer Science This course provides continued exploration of data mining algorithms. More sophisticated algorithms such as support

More information

THE THREE "Rs" OF PREDICTIVE ANALYTICS

THE THREE Rs OF PREDICTIVE ANALYTICS THE THREE "Rs" OF PREDICTIVE As companies commit to big data and data-driven decision making, the demand for predictive analytics has never been greater. While each day seems to bring another story of

More information

Chapter ML:XI. XI. Cluster Analysis

Chapter ML:XI. XI. Cluster Analysis Chapter ML:XI XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer

More information

Learning outcomes. Knowledge and understanding. Competence and skills

Learning outcomes. Knowledge and understanding. Competence and skills Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Spring 2015 Thomas Hill, Ph.D. VP Analytic Solutions Dell Statistica Overview and Agenda Dell Software overview Dell in

More information

What s Trending in Analytics for the Consumer Packaged Goods Industry?

What s Trending in Analytics for the Consumer Packaged Goods Industry? What s Trending in Analytics for the Consumer Packaged Goods Industry? The 2014 Accenture CPG Analytics European Survey Shows How Executives Are Using Analytics, and Where They Expect to Get the Most Value

More information

Course Syllabus For Operations Management. Management Information Systems

Course Syllabus For Operations Management. Management Information Systems For Operations Management and Management Information Systems Department School Year First Year First Year First Year Second year Second year Second year Third year Third year Third year Third year Third

More information

How To Use Data Mining For Loyalty Based Management

How To Use Data Mining For Loyalty Based Management Data Mining for Loyalty Based Management Petra Hunziker, Andreas Maier, Alex Nippe, Markus Tresch, Douglas Weers, Peter Zemp Credit Suisse P.O. Box 100, CH - 8070 Zurich, Switzerland markus.tresch@credit-suisse.ch,

More information

Importance or the Role of Data Warehousing and Data Mining in Business Applications

Importance or the Role of Data Warehousing and Data Mining in Business Applications Journal of The International Association of Advanced Technology and Science Importance or the Role of Data Warehousing and Data Mining in Business Applications ATUL ARORA ANKIT MALIK Abstract Information

More information

This Symposium brought to you by www.ttcus.com

This Symposium brought to you by www.ttcus.com This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data

More information

Web Data Mining: A Case Study. Abstract. Introduction

Web Data Mining: A Case Study. Abstract. Introduction Web Data Mining: A Case Study Samia Jones Galveston College, Galveston, TX 77550 Omprakash K. Gupta Prairie View A&M, Prairie View, TX 77446 okgupta@pvamu.edu Abstract With an enormous amount of data stored

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

DATA MINING IN FINANCE

DATA MINING IN FINANCE DATA MINING IN FINANCE Advances in Relational and Hybrid Methods by BORIS KOVALERCHUK Central Washington University, USA and EVGENII VITYAEV Institute of Mathematics Russian Academy of Sciences, Russia

More information

NEURAL NETWORKS IN DATA MINING

NEURAL NETWORKS IN DATA MINING NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track)

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track) Study Plan for the Master Degree In Industrial Engineering / Management (Thesis Track) Plan no. 2005 T A. GENERAL RULES AND CONDITIONS: 1. This plan conforms to the valid regulations of programs of graduate

More information

Data Mining Analytics for Business Intelligence and Decision Support

Data Mining Analytics for Business Intelligence and Decision Support Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley

More information

Demystifying Big Data Government Agencies & The Big Data Phenomenon

Demystifying Big Data Government Agencies & The Big Data Phenomenon Demystifying Big Data Government Agencies & The Big Data Phenomenon Today s Discussion If you only remember four things 1 Intensifying business challenges coupled with an explosion in data have pushed

More information

Using Data Mining for Mobile Communication Clustering and Characterization

Using Data Mining for Mobile Communication Clustering and Characterization Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer

More information

Cleaned Data. Recommendations

Cleaned Data. Recommendations Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110

More information

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information

An Introduction to Advanced Analytics and Data Mining

An Introduction to Advanced Analytics and Data Mining An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit

More information