Abstract:
Water is one of the most basic element supporting life and environment for every leaving as
well as non-leaving thing. For predicting a water quality now a day’s lots of techniques are
implemented like Data Mining, Remote Sensing, etc. Data Mining is becoming the most
popular technique for handling huge amount of water and its related data. At present The
Central Pollution Control Board (CPCB) provides data about water and its quality which is
very difficult to understand so it is necessary to build a data model to monitor and analyze
the water quality based on the defined parameters. This paper represents how to handle the
large amount of data with the help of Data Mining basics and clustering technique with the
K-means to analyze the water quality based on the predefined water parameters.