avadesian

avadesian synced commits to main at avadesian/Vicuna from mirror

f22f2194c9 Register SmaugChatAdapter. (#3243)

1 day ago

avadesian synced commits to master at avadesian/skypilot from mirror

20493fb616 Quick error handling fix (#3447) syntax fix

1 day ago

avadesian synced commits to fix-docker-gpu at avadesian/skypilot from mirror

c24208d68f fix and add smoke test
44d7f81934 Apply suggestions from code review
Compare 2 commits »

1 day ago

avadesian synced commits to fix-controller-cloud at avadesian/skypilot from mirror

c8319b634f Merge remote-tracking branch 'origin/master' into fix-controller-cloud
c9f575cf3d better nc error message on mac (#3374) * initial commit * newline * comments * run linter * reminder for down * tentatively done with example * formatting * yapf * [Storage] Storage mounting tool permissions fix (#3215) * fix permissions * fix permissions * [LLM] Example for Serving Gemma (#3207) * Add serve for gemma and fix mixtral dependency * Add hf token * fix model len * Add comment * Serve your private gemma * fix serve yaml * readme * Remove chat completion due to the wrong template * add readme * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * address comments * Update README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Change to it * Add chat API * use HF_TOKEN env * typo --------- Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * [LLM] Add logo for Gemma (#3220) * Minor fixes for release 0.5.0 (#3212) * when removing cudo credential, sky check fails * remove tips * minor hint fix * fix cluster version for k8s * fix typo * [Docker] Add retry for docker pull due to daemon not ready (#3218) * Add retry for docker pull due to daemon not ready * longer wait time * longer wait time * retry earlier * add retry for retries as well * longer wait time * change wait time * format * Add comment * Fix * Fix indent for azure docker config * Fix docker login config * Fix comments * More robust docker login config * Add retry for docker check * minor fix * Add additional test for stop and start with docker * Fix cancelled * added comments * quick fix * finished pip issues * fix * fix storage error message, add example link to docs * changed error message if default nc installed on mac * refactored check_port_forward_mode_dependencies function * update comment --------- Co-authored-by: Sheth <shethhriday29@berkeley.edu> Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu> Co-authored-by: Zhanghao Wu <zhanghao.wu@outlook.com> Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com>
91d6d1bb54 [Core] Add ~/.local/bin to make `which ray` work if ray is installed in `~/.local` (#3368) * save ray path * only use path in file when non empty * try grepping but fail * hardcode ~/.local/bin * format * add comments * Add comments * format * avoid backward compat test conflict * fix sleep * Fix the task ID for spot pipeline
e60eb73642 [Spot] Refactor spot APIs into `spot.xxx` (#3417) * Refactor spot core APIs to `sky.spot.core` * Add comment * fix * format * change to spot_lib instead * Update sky/spot/core.py Co-authored-by: Tian Xia <cblmemo@gmail.com> * address comments * Add deprecation message * fix * format * minor fix for backward compat test * longer time * longer wait for spot backward test * fix --------- Co-authored-by: Tian Xia <cblmemo@gmail.com>
226c1eb094 [AI Gallery] Add quantized LLMs with Ollama (#3422) * WIP * arm64 support * wip * wip * add ollama to ai gallery * minor edits * minor edits * Updates * comments * Add 'new' tag
Compare 13 commits »

1 day ago

avadesian synced commits to master at avadesian/skypilot from mirror

c9f575cf3d better nc error message on mac (#3374) * initial commit * newline * comments * run linter * reminder for down * tentatively done with example * formatting * yapf * [Storage] Storage mounting tool permissions fix (#3215) * fix permissions * fix permissions * [LLM] Example for Serving Gemma (#3207) * Add serve for gemma and fix mixtral dependency * Add hf token * fix model len * Add comment * Serve your private gemma * fix serve yaml * readme * Remove chat completion due to the wrong template * add readme * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * address comments * Update README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Update llm/gemma/README.md Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * Change to it * Add chat API * use HF_TOKEN env * typo --------- Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * [LLM] Add logo for Gemma (#3220) * Minor fixes for release 0.5.0 (#3212) * when removing cudo credential, sky check fails * remove tips * minor hint fix * fix cluster version for k8s * fix typo * [Docker] Add retry for docker pull due to daemon not ready (#3218) * Add retry for docker pull due to daemon not ready * longer wait time * longer wait time * retry earlier * add retry for retries as well * longer wait time * change wait time * format * Add comment * Fix * Fix indent for azure docker config * Fix docker login config * Fix comments * More robust docker login config * Add retry for docker check * minor fix * Add additional test for stop and start with docker * Fix cancelled * added comments * quick fix * finished pip issues * fix * fix storage error message, add example link to docs * changed error message if default nc installed on mac * refactored check_port_forward_mode_dependencies function * update comment --------- Co-authored-by: Sheth <shethhriday29@berkeley.edu> Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu> Co-authored-by: Zhanghao Wu <zhanghao.wu@outlook.com> Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com>
91d6d1bb54 [Core] Add ~/.local/bin to make `which ray` work if ray is installed in `~/.local` (#3368) * save ray path * only use path in file when non empty * try grepping but fail * hardcode ~/.local/bin * format * add comments * Add comments * format * avoid backward compat test conflict * fix sleep * Fix the task ID for spot pipeline
Compare 2 commits »

3 days ago

avadesian synced commits to main at avadesian/guidance from mirror

e18be92f53 Update README.md
5c9776e0ae First stab at minimal mypy implementation (#738) Issue: https://github.com/guidance-ai/guidance/issues/729 https://github.com/guidance-ai/guidance/issues/257 - **Set up the structure of mypy installs, now to start typing things** - **fixed guidance/library/_regex.py:13: SyntaxWarning: invalid escape sequence '\['** - **First stab at minimal mypy implementation** This is a non-strict implementation of mypy, with cicd integrated. After this is approved, we can either slowly move towards a strict implementation, or each PR can simply choose how much they want/need to contribute to the typing system to help themselves understand the codebase.
c366498702 Generation with `pydantic` schemas (#724) This PR adapts some previous work by @riedgar-ms to enable generating valid JSON given a pydantic model.
03d2b978df [Test] Expand testing of substring() (#751) The `substring()` function wasn't being tested in the builds. Make the following changes: - Add a `name` argument to `substring()` so we can extract the parts we need - Tweak the existing test, so that it is not skipped - Add several tests using the `Mock` which should capture the various automaton states in `substring()`
8a34f7a054 Merge pull request #750 from riedgar-ms/riedgar-ms/extra-docs-01 [Docs] Expand documentation
Compare 13 commits »

3 days ago

avadesian synced commits to operation at avadesian/Vicuna from mirror

4 days ago

avadesian synced commits to main at avadesian/Vicuna from mirror

5095615810 Code update (#3194) Co-authored-by: simon-mo <simon.mo@hey.com>
2d9e2a6111 Store Images Remotely on GCS (#3172)
Compare 2 commits »

4 days ago

avadesian synced commits to spot-hedge at avadesian/skypilot from mirror

4ddebf344f bugfix
0eda8313bc e2e info dump & bugfix
d42e679f88 added overprovision
10425edfa4 remove test.yaml
ed685d64ee disable retry_until_up
Compare 6 commits »

4 days ago

avadesian synced commits to serve_k8s_playground at avadesian/skypilot from mirror

b95c664918 Global logging and stdout to container logs
8494c574b5 make ssh services optional
Compare 2 commits »

4 days ago

avadesian synced commits to ray_path at avadesian/skypilot from mirror

19985e92ea Fix the task ID for spot pipeline
e03e7a990f fix sleep
4b40d30365 avoid backward compat test conflict
48e57e214e format
a2a2d1ecc1 Add comments
Compare 40 commits »

4 days ago

avadesian synced commits to main at avadesian/Vicuna from mirror

7524a58eba Add support for Smaug-2. (#3211)
6a98121b87 Reka AI model integration (#3235) Co-authored-by: Che Zheng <chezheng@reka.ai>
a0866e8a11 Update README.md (#3239)
12f1873153 Add YandexGPT API support (#3116)
Compare 4 commits »

1 week ago

avadesian synced commits to serve_k8s_final at avadesian/skypilot from mirror

3d24b41c49 fix docs build
99547dba78 refactor
87c77df40d refactor
33dfb01354 add lb comment
937feb6701 expand elif
Compare 40 commits »

1 week ago

avadesian synced commits to master at avadesian/skypilot from mirror

e60eb73642 [Spot] Refactor spot APIs into `spot.xxx` (#3417) * Refactor spot core APIs to `sky.spot.core` * Add comment * fix * format * change to spot_lib instead * Update sky/spot/core.py Co-authored-by: Tian Xia <cblmemo@gmail.com> * address comments * Add deprecation message * fix * format * minor fix for backward compat test * longer time * longer wait for spot backward test * fix --------- Co-authored-by: Tian Xia <cblmemo@gmail.com>
226c1eb094 [AI Gallery] Add quantized LLMs with Ollama (#3422) * WIP * arm64 support * wip * wip * add ollama to ai gallery * minor edits * minor edits * Updates * comments * Add 'new' tag
Compare 2 commits »

1 week ago

avadesian synced commits to spot-refactor at avadesian/skypilot from mirror

272761d656 longer time
0ddf5e6305 minor fix for backward compat test
77cd0fe31b format
cedf420cf3 fix
63452e7b3f Add deprecation message
Compare 7 commits »

1 week ago

avadesian synced commits to serve_k8s_playground at avadesian/skypilot from mirror

bfa54c0301 add PODIP mode support

1 week ago

avadesian synced commits to master at avadesian/skypilot from mirror

d0f20abaa5 [kubernetes] Add curl and other dependencies in k8s image (#3392) Add curl and other dependencies in k8s image

1 week ago

avadesian synced commits to examples_ollama at avadesian/skypilot from mirror

b29b5b9f77 minor edits
2a89d19a9b minor edits
Compare 2 commits »

1 week ago

avadesian synced commits to serve_k8s_playground_ha at avadesian/skypilot from mirror

91722ebbe4 add sorting for out-of-order timestamps in HA

1 week ago

avadesian synced commits to master at avadesian/skypilot from mirror

c65b258acf [UX] Add cluster info in task envs (#3426) * Add task name for the spot job * Add dag name and task name for spot job * fix dag * starting 1 * format * Address comments * add env vars * new line * Add cluster info in the env vars * add spot in the cluster info * fix env var docs * cloud change to str * Fix quoting * Add example for parsing json string * format * address comments * Add smoke tests * format * update doc
48a5c63c42 [Serve] Fail early for user app failure and expose failure reasons (#3411) * expose detailed replica failure and rename service failure to crash loop * fix path in test * add target qps * shorter wait time * fix smoke * fix smoke test * add `;` back * Revert crash loop * update failed_status * typo * format * Add initial delay failure * Add initial delay failure * format * do not scale when user app fails * format * Add tests for failure statuses * syntax error * make service termination more robust * fix smoke test * Fix permission issue if not tpu is not needed * fix test * fail early for initial delay timeout * format * format * remove unecessary logger * fix * explicit annotation of scale down * fix logs * format * fix test with non spot * Address comments * add comments * longer time * Update sky/serve/autoscalers.py Co-authored-by: Tian Xia <cblmemo@gmail.com> * Update sky/serve/autoscalers.py Co-authored-by: Tian Xia <cblmemo@gmail.com> * Update sky/serve/replica_managers.py Co-authored-by: Tian Xia <cblmemo@gmail.com> --------- Co-authored-by: Tian Xia <cblmemo@gmail.com>
48b8ca978e [Spot] Add spot job name in the `SKYPILOT_TASK_ID` env var (#3424) * Add task name for the spot job * Add dag name and task name for spot job * fix dag * starting 1 * format * Address comments * add env vars * new line * Update docs/source/running-jobs/environment-variables.rst Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * rename TASK_ID and add the env var in setup * format * remove _spot_ * fix docs * fix * fix * format * fix * Update docs/source/running-jobs/environment-variables.rst Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * update docs * starting 0 * slight update * use quote --------- Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
Compare 3 commits »

1 week ago