Data Sanitization and Restoring Using Python and ML | Source Code
LIMITED TIME
Get Source Code ₹99
Open Live Demo
Real project UI Full source included Opens in new tab

Tap to open live demo

Interactive live demo — verify the project before you buy

Data Sanitization and Restoring Using Python and ML

Live Demo

Complete final-year project source code with frontend, backend, database, and setup guide. Instant download after secure payment.

  • MACHINE-LEARNING Stack
  • Instant Download
Secure CCAvenue payment Instant download link WhatsApp support

Choose your plan

Source Code Only

Full ZIP with frontend, backend, database & documentation.

₹99 one-time
  • Complete project source files
  • Database script included
  • How-to-run guide

What's in your download

Review features, setup steps, and credentials before you pay.

Project Overview

Description, tech stack, and what is included

Full source Frontend + backend
Database .sql file
Setup guide README included

DataSecure Pro is a final year project built with Python Flask and machine learning for students who want a practical data privacy and sanitization web application. This final year project allows users to upload CSV or XLSX datasets, preview data, run sanitization modes such as masking, anonymization, cleaning, encoding, and sensitive-column detection, then download sanitized outputs. Users can also request controlled restoration of sanitized files, while admins approve, reject, or run restoration workflows using stored mappings where possible. The admin side of this final year project includes users, categories, sensitive data types, sanitize/restore rules, datasets, sanitized files, restoration tickets, ML models, value mappings, reports, and audit logs. With ML-assisted column detection, KPI exports, and governance workflows, DataSecure Pro is suitable for a final year major project in data privacy, Flask, ML, and cybersecurity.

Technical snapshot

Project
Data Sanitization and Restoring Using Python and ML
Stack
MACHINE-LEARNING
Includes
Code, DB, README
License
Academic submission
Secure CCAvenue payment · Instant download · Need help? WhatsApp us

Ready to download?Pay once · Use for submission & viva

Admin Features

Modules and controls available to administrators

  • Admin dashboard for this final year project with users, datasets, sanitized/restored artefacts, mappings, pending restoration tickets, and recent audit events
  • User search, dossier view, edit, active toggle, and purge account workflow
  • Per-user file list
  • Category CRUD with delete blocking when datasets reference the category
  • Sensitive data type CRUD with delete blocking when rules reference the type
  • Sanitize and restore rule management with JSON config and active flag
  • Global dataset list/search and dataset dossier with lineage
  • Dataset status, category, and workspace field editing
  • Deep dataset delete with cleanup
  • Original dataset preview
  • Operator-triggered sanitization
  • Sanitized file browsing, search, detail, preview, compare, and deep delete
  • Restoration ticket queue with approve, reject, and run restoration actions
  • ML model registry and Train/refresh workflow
  • Value mapping list with dataset filters, pagination, delete, and bulk old mapping delete
  • Sanitization and restoration KPI reports
  • CSV and PDF report downloads
  • Audit log filtering by action and user ID
  • Preview and compare tools for original, sanitized, and restored rows

User Features

What end users can do in this application

  • User login and portal access in this final year project
  • Dashboard with datasets and recent activity overview
  • Upload CSV/XLSX tabular files within configured size limits
  • Optional category assignment and workspace notes
  • My datasets list with metadata, preview, edit, delete, and original download
  • Sanitization mode selection for a dataset
  • ML column detection preview
  • Sanitization engine using ML detection and rules
  • Sanitized output listing and detail
  • Sanitized file compare/preview flows
  • Sanitized file delete or download
  • Restoration request creation against sanitized files
  • Column-scope restoration request handling
  • Restoration status tracking
  • Restored artefact downloads when completed and allowed
  • User-visible KPI-style reports
  • Account history and profile update

Other Features

Additional capabilities included in the project

  • Public landing page included in this final year project
  • SQLite default database with DATABASE_URL override
  • CSV and XLSX upload support
  • ReportLab-based PDF reporting
  • SensitiveColumnDetector trained on curated column labels
  • RandomForest model using TF-IDF word/character n-grams and lexical features
  • Heuristic support for email and phone-like cell patterns
  • APP_DATA_ROOT support for writable shared-hosting storage
  • Seed script with demo users, categories, rules, and sample CSV
  • Upload folders for original, sanitized, and restored files
  • ML model storage under ml_models/
  • Optional scripts for building sensitive column labels and importing CSV headers
  • Suitable for final year project demo, viva, source code review, and report preparation

How to Run

Step-by-step setup on your laptop or PC

  • Open the final year project folder:
    cd "Dual Data Sanitization and Restoration Using Machine Learning"
  • Create a virtual environment:
    python -m venv .venv
  • Activate the environment:
    • Windows: .venv\Scripts\activate
    • macOS/Linux: source .venv/bin/activate
  • Install dependencies:
    pip install -r requirements.txt
  • Optional: set environment variables such as:
    • SECRET_KEY
    • DATABASE_URL
    • APP_DATA_ROOT
    • SEED_ADMIN_PASSWORD
  • Seed the database and demo files:
    python seed.py
  • Run the final year Flask project:
    python run.py
  • Open:
    http://127.0.0.1:5000
  • First-time flow: login as admin or seeded user, upload a dataset, preview ML detection, run sanitization, and test restoration workflow

Login Credentials

Default demo accounts for testing after setup

Administrator

  • Username: admin
  • Password: ChangeMeAdmin!2026 or the value of SEED_ADMIN_PASSWORD
  • Email: [email protected]

Demo Users
All demo users use password: user123 unless changed in seed logic.

  • priyanair — owns bundled seed_sample.csv dataset row
  • johnc

License

Usage terms for academic and personal projects

Related Tags

Search terms and categories for this source code

Data Sanitization and Restoring Using Python and ML Source Code Final Year MACHINE-LEARNING Project Ready-to-Run Code With Database File Plagiarism-Free Faculty Approved data sanitization final year project data restoration project Flask machine learning project sensitive column detection system data anonymization source code Python data privacy project CSV XLSX sanitization restoration approval workflow audit log project DataSecure Pro