-
Notifications
You must be signed in to change notification settings - Fork 675
[Metrics] Install and configure prometheus server on skypilot cluster (cloud as infra source) #5928
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
rohansonecha
wants to merge
29
commits into
metrics-exporters
Choose a base branch
from
cloud-head-prometheus-srv
base: metrics-exporters
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* no postgres * enlightenment
initial implementation
* UI: add copy buttons to entrypoint/YAML. * lint * Revert "lint" This reverts commit 9272e31. * lint
* avoid showing cluster yaml for controllers * format
…5905) * refactor * revert change * pipeline update * fix * update name * bug fix
use selected branch
* fix schedule input * add if
fix CLI args
* Nebius VM network tier --------- Co-authored-by: Maknee <henry@assemblesys.com> Co-authored-by: Zhanghao Wu <zhanghao.wu@outlook.com>
Co-authored-by: Maknee <henry@assemblesys.com>
* Volume mounting for SSH Node Pools * make cloud optional arg * Comments * minor docs
* avoid showing cluster yaml for controllers * format * use user yaml * format * format * Minor UX changes * Duration fixes. * revert j2 template removal --------- Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
return empty list when no config is loaded
…owed_clouds` (#5729) * implement * format * wip overhaul will break * prelim * done
* support setting private workspace and add users to it * fixes for permission * Fix private workspace * format * update code and fix some issues * address comments * Add private badge and avoid checks for no allowed users * Add user hash to user table * fix logging * fix ut and remove useless code * format * Add users view in private workspace * remove useless code * format * ignore dashboard modules in dockerignore * Use user email * format * fix manifest * fix user info overriding * remove duplicate code * Fix server user * only get workspace a user has access to * Add icon for user role * Fix the workspace checking user setting * adjust avatar size * format * filter jobs for private workspaces * fix user fetching * fix the user role in sidebar * ui fixes * Add todos * minor fix * only check specific cloud * fix active job check * Fix user name resolution and add docs * fix message * Add unit test for workspace user resolution * add unit test * type * update code based on latest update * fix interface * use config override and avoid skipping workspace test * revert smoke test * Add unit test for user resources visibility * format * minor * Add note for auth proxy * fixes * format --------- Co-authored-by: Zhanghao Wu <zhanghao.wu@outlook.com>
* add serverside check to update config * Revert "add serverside check to update config" This reverts commit 0da7f03. * serverside check
* Yield worker process while waiting for retry Signed-off-by: Aylei <rayingecho@gmail.com> * Fix UT Signed-off-by: Aylei <rayingecho@gmail.com> * Fix UT Signed-off-by: Aylei <rayingecho@gmail.com> * Address review comments Signed-off-by: Aylei <rayingecho@gmail.com> * Update sky/server/requests/process.py Co-authored-by: Christopher Cooper <cooperc@assemblesys.com> --------- Signed-off-by: Aylei <rayingecho@gmail.com> Co-authored-by: Christopher Cooper <cooperc@assemblesys.com>
extra init containers
Signed-off-by: Aylei <rayingecho@gmail.com>
Signed-off-by: Aylei <rayingecho@gmail.com>
Signed-off-by: Aylei <rayingecho@gmail.com>
Signed-off-by: Aylei <rayingecho@gmail.com>
Signed-off-by: Aylei <rayingecho@gmail.com>
…on aws, gcp, and azure
…ot cloud cluster head node
b9b98b1
to
7d9f63c
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Tested (run the relevant ones):
bash format.sh
/smoke-test
(CI) orpytest tests/test_smoke.py
(local)/smoke-test -k test_name
(CI) orpytest tests/test_smoke.py::test_name
(local)/quicktest-core
(CI) orpytest tests/smoke_tests/test_backward_compat.py
(local)