AWS Elasticsearch Web Performance Data Shipper

Programatically ship endpoint web browser preformance data to an AWS Elasticsearch Service endpoint. This project is currently under Development. Please check back soon.

Overview

Selenium boots, and tries to fully load different websites, collects Preformance Timings listed below and shipps them to an AWS Elasticsearch Service.

Flow of Performance Timings on Web Browsers:

navigationStart -> redirectStart -> redirectEnd -> fetchStart -> domainLookupStart -> domainLookupEnd -> connectStart -> connectEnd -> requestStart -> responseStart -> responseEnd -> domLoading -> domInteractive -> domContentLoaded -> domComplete -> loadEventStart -> loadEventEnd

Project Structure

See below for an explanation of the files in the tree.

├── Pipfile → Pipenv requirements.
├── Pipfile.lock → Pipenv lock file.
├── README.md
├── Dockerfile
├── requirements.txt → Python requirements.
├── run.py → Run script.
└── src
    ├── __init__.py
    ├── config.py → Developer created config file.
    ├── config_template.py → Template for config.py file.
    ├── cronTab.py →
    └── seleniumDataShipper.py → Main shipper script.
└── test
    ├── __init__.py
    └── test.py → Unittest.

Run This Yourself

Local Development

This data shipper is running on Python 3.7. We strongly advise the use of either Anaconda or pipenv to manage a virtual environment in which you can install the dependencies for local development.

Without Docker

Clone this repository.
Run pip install -r requirements.txt to install the dependencies.
Generate some (AWS Elasticsearch Service) keys, and put them in a src/config.py file like so:

Please use src/config_template.py file as a template.

AWS_ES_ENDPOINT = {
    'aws_access_key_id': '',
    'aws_secret_access_key': '',
    'host': '',
    'region': '',
    'service': ''
}

To add the Elasticsearch index and source specific details look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

ES_INDEX = {
    'index': '',
    'doc_type': '',
    'source': ''
}

To add websites (i.e. https://www.google.com) look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

webPage = ["https://www.google.com"]

Install the ChromeDriver by using the command specific to your OS:

On macOS: brew cask install chromedriver
- Use Homebrew to install the chrome driver by running the command: brew cask install chromedriver
  - If you have already installed the ChromeDriver once and need to update it:
    - First try brew cask install chromedriver
    - If that errors try brew cask reinstall chromedriver
    - If that also errors run rm /usr/local/bin/chromedriver and then rerun brew cask install chromedriver
- Confirm it installed by running: chromedriver -version
On Windows: choco install chromedriver
- Start by installing the Chocolatey dependency manager using the Install with CMD method.
- Then install the ChromeDriver by running: choco install chromedriver
- Confirm it installed by running: chromedriver -version

With Docker

Clone this repository.
Generate some (AWS Elasticsearch Service) keys, and put them in a src/config.py file like so:

Please use src/config_template.py file as a template.

AWS_ES_ENDPOINT = {
    'aws_access_key_id': '',
    'aws_secret_access_key': '',
    'host': '',
    'region': '',
    'service': ''
}

To add the Elasticsearch index and source specific details look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

ES_INDEX = {
    'index': '',
    'doc_type': '',
    'source': ''
}

To add websites (i.e. https://www.google.com) look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

webPage = ["https://www.google.com"]

Build the docker container with docker build -t aws-es-web-performance-shipper .
Run the docker container with docker run aws-es-web-performance-shipper python run.py --verbose=no and your off!

Production

This data shipper is running on Python 3.7. We strongly advise the use of Anaconda to manage a virtual environment in which you can install the dependencies.

Prod Without Docker

We strongly recommend that this data shipper be run on a docker instance (due to the ease of installation) on the host machine, however if you choose not to, the steps below outline the installation procedure.

Clone this repository.
Run pip install -r requirements.txt to install the dependencies.
Generate some (AWS Elasticsearch Service) keys, and put them in a src/config.py file like so:

Please use src/config_template.py file as a template.

AWS_ES_ENDPOINT = {
    'aws_access_key_id': '',
    'aws_secret_access_key': '',
    'host': '',
    'region': '',
    'service': ''
}

To add the Elasticsearch index and source specific details look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

ES_INDEX = {
    'index': '',
    'doc_type': '',
    'source': ''
}

To add websites (i.e. https://www.google.com) look into the src/config.py you created, and put them in like so:

Please use src/config_template.py as a reference.

webPage = ["https://www.google.com"]

Install the ChromeDriver by using the command for your specific OS:

On macOS: brew cask install chromedriver
- Use Homebrew to install the chrome driver by running the command: brew cask install chromedriver
  - If you have already installed the ChromeDriver once and need to update it:
    - First try brew cask install chromedriver
    - If that errors try brew cask reinstall chromedriver
    - If that also errors run rm /usr/local/bin/chromedriver and then rerun brew cask install chromedriver
- Confirm it installed by running: chromedriver -version
On Windows: choco install chromedriver
- Start by installing the Chocolatey dependency manager using the Install with CMD method.
- Then install the ChromeDriver by running: choco install chromedriver
- Confirm it installed by running: chromedriver -version

Run ./run.py and you're off!

Dependencies

Meet the Team

Corey Sabia

Lead Developer

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
.vscode		.vscode
src		src
test		test
.gitattributes		.gitattributes
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
dockerfile		dockerfile
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AWS Elasticsearch Web Performance Data Shipper

Table of Contents

Overview

Project Structure

Run This Yourself

Local Development

Without Docker

With Docker

Production

Prod Without Docker

Dependencies

Meet the Team

About

Uh oh!

Releases

Packages

Uh oh!

Languages

coreysabia/AWS-Elasticsearch-Web-Performance-Shipper

Folders and files

Latest commit

History

Repository files navigation

AWS Elasticsearch Web Performance Data Shipper

Table of Contents

Overview

Project Structure

Run This Yourself

Local Development

Without Docker

With Docker

Production

Prod Without Docker

Dependencies

Meet the Team

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages