Cloud Mining - CRM Data Mining in the Cloud

Notebook Cloud Mining-300x254 in Cloud Mining - CRM Data Mining in the Cloud

Cloud Mining für CRM

Cloud Mining is a new approach to apply Data Mining to customer data. This article introduces Cloud Mining in a quick overview.

Data Mining is a determined technique to analyse Data in CRM, Marketing and Distribution. For example it helps optimizing customer interaction, shows buying potentials of customers and the churn probability by the use of statistical-mathematical methods on big amounts of data. Thereby companies can make marketing efforts more precise – they spendings less and achieve better effects.

Continue reading Cloud Mining – CRM Data Mining in the Cloud

Genetic Algorithms with T-SQL

Bot Square-300x183 in Genetic Algorithms with T-SQL

Diligent Data Mining Bot!

Applying Data Mining with only SQL is considered very rare, but it is actually possible to solve some problems. As you can read in the article A Genetic Algorithm Sample in T-SQL by William Talada, it can solve a quite interessting problem.

A field with 10×10 squares is covered to the half with empty cans, it is bordered by a impassable wall. Now a poor tiny robot is send to clear the whole field. He can walk in any direction or pick up a can. His view is reduced to 5 squares, the one he stands on am the four fields that adjoin.
Continue reading Genetic Algorithms with T-SQL

Data Mining in the KDD Environment

In 1996 Osama Fayyad proposed a very popular process how to make a companies data useful for business needs. Data Mining is described to be a part of the KDD Process, actually quite small in this definition, but very important. After reading this article, you will understand why Data Mining needs a pre- and post-processing and just can’t stand alone against all the Data.

The approach to gain knowledge out of a set of data was separated by Fayyad into individual steps. The individuality results out of different tools you use, and different outcomes that are needed.

Fayyad1996-300x141 in Data Mining in the KDD Environment

KDD process by Fayyad 1996

The KDD Process stands for the Knowledge Discovery in Databases. According to Fayyad there are five steps: Selection, Pre-processing, Transformation, Data Mining and Interpretation. These five steps are passed through iteratively. Every step can be seen as a work-through phase. Such a phase requires the supervision of a user and can lead to multiple results. The best of these results is used for the next iteration, the others should be documented. In the following, the steps will be briefly described.

Continue reading Data Mining in the KDD Environment

Massive Prospects

MASSIVE Battle in Massive Prospects

Screenshot MASSIVE Software

MASSIVE (short for Multiple Agent Simulation System in Virtual Environment) should be quite well known to all the cinema-enthusiastics and Lord of the Rings-Fans – at least the stunning result when it comes to huge crowded war mass scenes. But not necessarily to the Data Mining friend. New Zealands’ special effects company WETA developed this nice piece of software and left an impressing first fingerprint in the world of cinema.

The software is exciting, because every single simulated agent is driven by his own Fuzzy Logic individually, as if he would make real decisions. The agents decides out of a given set of information, if he runs away or fights. Running away can be an option even for a mighty uruk hai-orc, especially if everyone else does it …
Continue reading Massive Prospects

Overrepresentation - "SAS"-Oversampling

Nadel in Overrepresentation - SAS-Oversampling

Typical Data Mining task: Looking for a needle in a haystack!

To improve your Data Mining result when only having a small amount of target variables, it is useful to oversample the target variable. It is shown here how this works – and how to undo it when dealing with the result.

In cases where the target variable appears in a fraction of less then 10%, it is common to stratify the occurrence of the target variable. That should improve the result of your Data Mining challange. The term “oversampling” is used by SAS in their Enterprise Miner Software, to higher the relative occurence of the target variable without using copies – but by reducing the occurence of the non-target variable. Be advised that “oversampling” is also called to duplicate the content – you should check that out at zyxos Blog. We will stick to the quite simple view of SAS.
Continue reading Overrepresentation – “SAS”-Oversampling

27th National Database Conference of China

Event in 27th Nationale Database Conference of ChinaLocation: Beijing, P.R. China

Date: October 14-17, 2010

to the Website

Topics: (but not limited to)

  • Emerging Technology for Database
  • Data Management on Cloud
  • Web Data Management
  • Query Processing and Optimization
  • Stream Data Management
  • XML and Semi-Structured Data Mining
  • Data Warehouse and On-Line
    Analytic Processing
  • Approximate and Uncertain Data Management
  • Content and Knowledge Management
  • Data Mining and Knowledge
  • Meta-data Management
  • Data Integration and Migration
  • Embedded Database Systems and Mobile Data Management
  • Parallel and Distributed Database
  • Domain-Specific Database Systems
  • Self-Managing Technology in DBMS
  • Intelligent User Interface Technology
  • Spatial and Temporal Database
  • Multimedia Data Management
  • Data Privacy and Security
  • Information Retrieval and

Cloud Mining at Layered Technologies?

Layered-tech in Cloud Mining at Layered Technologies?As reported by Katie Brooks from, there will be a new Virtual Private Data Center (VPDC) platform from Layered Technologies. The provider of on-demand IT infrastructure tries to overcome users concerns if their data is safe by having a enterprise level security standart. It will be possible to “design, order and deploy a secure virtualized environment within an hour”.  Customers can choose whatever plattform they need, including 3Tera’s AppLogic, VMware and Microsoft Hyper-V. This might be a good SaaS Provider for cloud mining, the Data Mining of the future.