Recommend an automated operation and maintenance management script baseline
Reprinted from: Open Source Linux
Script baseline for automated operation and maintenance management, including service automation processing and operation. This is where scripts are mainly stored.
Here we cooperate with Qiniu Cloud Storage to manage large file storage
Automation of operation and maintenance
Basic inspection:
-
1. Application survival status inspection, once every minute, email notification
-
2. Server survival status inspection
-
3. Automatic packaging of basic images, including Dockerfile
-
4. Automatic database upload and backup
-
5. Daily status inspection of K8S operation status
-
6. Automatic cleanup of Docker images
-
7. Automatically clean up redundant log files
-
8. Third-party interface status inspection
-
9. Middleware abnormal inspection and automatic startup
-
10. Application dependency package security vulnerability inspection
-
11. Java dependency package security vulnerability inspection
-
12. Server security port inspection
Business inspection:
-
1. System abnormal login inspection
Operation and maintenance work covers many aspects
Here are some of the main aspects:
-
1. System operation and maintenance :
-
• Operating system management : including installation, configuration and maintenance of operating systems such as Linux, Windows, etc.
-
• Security patches and updates : Ensure system security and perform timely security patch and update management.
-
• System monitoring and performance optimization : Monitor system resource utilization and performance indicators, and perform optimization and adjustments to ensure efficient system operation.
-
2. Network operation and maintenance :
-
• Network equipment management : manage the configuration and maintenance of network equipment such as routers and switches.
-
• Network security : Monitor and maintain network security, including firewall configuration, intrusion detection systems, etc.
-
• Network performance optimization : Monitor network traffic, bandwidth usage, etc., and optimize to ensure smooth network operation.
-
3. Database operation and maintenance :
-
• Database installation and configuration : Install, configure and optimize databases such as MySQL, PostgreSQL , MongoDB, etc.
-
• Backup and recovery : Develop and implement database backup strategies to ensure data security and recoverability.
-
• Performance Tuning : Monitor database performance and optimize it to ensure efficient database operation.
-
4. Application operation and maintenance :
-
• Application Deployment : Manage and execute deployment and updates of applications.
-
• Application monitoring : Monitor the running status of the application, including log analysis, error tracking, etc.
-
• Application performance optimization : Optimize and adjust application performance issues.
-
5. Security and Compliance :
-
• Security strategy and implementation : Develop security strategies to ensure the security of systems and data.
-
• Compliance management : Follow industry standards and regulatory requirements to ensure compliance of systems and data.
-
6. Automation and script management :
-
• Automation Tools and Scripting : Use scripting and automation tools to simplify repetitive tasks and improve efficiency.
-
• Script management and version control : Manage and maintain scripts to ensure their security and maintainability.
-
7. Container and Cloud Infrastructure Management :
-
• Containerized technology management : Manage containerized environments such as Docker, Kubernetes, etc.
-
• Cloud infrastructure management : Managing resources on the cloud platform, including configuring, monitoring, and optimizing cloud services.
Directory Planning
ops/
├── roles/(ansible脚本)
├── system/(系统管理)
├── network/(网络监控)
├── database/(数据库管理)
├── application/(应用管理)
├── security/(安全合规)
├── automation/(自动化)
└── infrastructure/(基础设施)
└── templates/ (模板脚本)
Related specifications
-
1. The roles naming includes two parts: platform (aip) and business. The platform name starts with , while the business name is customized.
aip
-
2. Currently defined related operation identifiers:
-
• backup
-
• restore
-
• upgrade
-
• install
-
• uninstall
-
• check
-
• bus business
-
• security
-
• server
-
• clean
Download
https://pan.quark.cn/s/ab6c8ac8a6fe
autumn
The recruitment has already begun. If you are not well prepared,
autumn
It's hard to find a good job.
Here is a big employment gift package for everyone. You can prepare for the spring recruitment and find a good job!