A collection of practical Python scripts for interacting with AWS services, designed to help data engineers automate common tasks like connecting to S3, uploading files, and managing cloud resources.
This repository serves as a reference guide containing scripts that demonstrate:
- Connecting to AWS services (S3, etc.)
- Uploading files to S3
- Basic automation for data engineering workflows
- Common AWS operations using boto3
Ideal for beginners learning AWS and scripting for data engineering tasks.
- Python 3.7+
- AWS account with appropriate permissions
- AWS CLI configured or credentials in
~/.aws/credentials - boto3 Python library
Install boto3:
pip install boto3| Script | Description |
|---|---|
connect_s3.py |
Connects to an S3 bucket using boto3 |
upload_file.py |
Uploads a file from your local machine to a specified S3 bucket |
list_buckets.py |
Lists all S3 buckets in your account |
aws_scripts_for_data_enginners/
βββ connect_s3.py
βββ upload_file.py
βββ list_buckets.py
βββ README.md
βββ requirements.txt
- Never commit AWS secret keys to the repository
- Use IAM roles or environment variables to manage credentials securely
- Consider using AWS Secrets Manager or Parameter Store for sensitive data
- Add credential files to
.gitignore
Contributions are welcome! Feel free to:
- Submit bug reports or feature requests via Issues
- Fork the repository and submit Pull Requests
- Improve documentation or add new scripts
This project is open source and available under the MIT License.
Vrishab Sharma
- GitHub: @SharmaVrishab
β If you find this repository helpful, please consider giving it a star!