Module-1
DBMS - Notes
Data
Raw, unprocessed facts or details without context or meaning. Data is unorganized and often collected from observations, measurements, or recordings.
Examples: Numbers, dates, text, images, or symbols that need interpretation (e.g., "25," "2023-09-04," or "John").
Information
Processed, organized, or structured data that is meaningful and useful for decision-making or understanding.
Examples: A report showing the average temperature, a summarized sales report, or an analyzed dataset showing trends.
Record & Data File
Record: A collection of various types of data of similar field.
Data file: A collection of records, sometimes called a table.
Persistent Data
Information stored in a way that it remains available after system restart. Saved on devices like hard drives, SSDs, or databases.
Database
An organized collection of data that allows easy retrieval, insertion, updating, and deletion.
File
A file is a collection of data stored on a computer or device. Files can contain text, images, videos, or programs.
File Management System
Software to organize, store, retrieve, and manage files using folders, search, backup, etc.
File Structure
The way data is arranged within a file, e.g., headers, metadata, data content.
File Organization
How files are arranged on a storage device. Can be hierarchical or flat.
Types of File Organization
- Sequential: Simple, efficient for sequential access. Poor for random access.
- Indexed Sequential: Uses index for fast random access. Requires extra storage.
- Direct: Uses hashing. Efficient but prone to collisions.
- Clustered: Based on common attribute. Fast for specific queries.
- Hash: Advanced hashing to reduce collisions.
- Multi-level Indexing: Handles large datasets efficiently.
Database Management System (DBMS)
A DBMS helps organize, store, and retrieve data efficiently using tables. It supports data manipulation, retrieval, security, and sharing.
Evolution of Databases and DBMS
Manual: Paper records and ledgers.
Relational: Tables and SQL introduced by Edgar F. Codd.
NoSQL: For large, flexible, scalable systems (MongoDB, Redis, etc.).
Cloud: Cloud-hosted databases with low maintenance and high accessibility.
Characteristics of Databases
- Organization: Structured tables, defined relationships.
- Consistency: Integrity and validation rules.
- Efficiency: Fast data access, indexing.
- Security: Access control and encryption.
- Reliability: Backup and recovery options.
- Scalability: Can handle growth in data and users.
- Data Independence: Logical/physical separation.
- Concurrency: Supports multiple users at once.
Approaches of DBMS
- Data Model: Relational, Object-Relational, NoSQL.
- Data Consistency: Strong, Eventual, Weak.
- Scalability: Horizontal and Vertical.
- Performance: Query speed, concurrency handling.
- Flexibility: Schema and data type adaptation.
- Reliability: Fault-tolerance, durability.
- Security: Access roles, encryption.
- Cost: Licensing, hardware, maintenance.
Components of DBMS
- Data
- Schema
- Software
- Hardware
- Users
- Procedures
Advantages of DBMS
- Controls redundancy
- Data sharing
- Easy maintenance
- Reduced development time
- Backup and recovery support
- Multiple user interfaces
Disadvantages of DBMS
- High hardware/software cost
- Large memory/storage usage
- Complexity
- High impact on failure