- LVLM: ARC Prize, Chatbot, LLM Benchmark, OmniSpatial
- Detection: Zero-Shot COCO, Open-Voc COCO, Open-Voc LVIS
- Text-encoder: MTEB, SuperGLUE
- 6D Object Pose: BOP
- other: KITTI, Mip-NeRF 360
Install the basic requirements:
pip install -r requirements.txtSelect preferences and run the command to install PyTorch >= 2.5.1 locally.
pip install open_clip_torchpip install mss git+https://github.com/facebookresearch/detectron2.git git+https://github.com/openai/CLIP.gitpip install provider pyrender torch-clusterpip install xformers==0.0.29 # for torch 2.5.1
pip install --no-deps git+https://github.com/ByteDance-Seed/Depth-Anything-3.git
pip install e3nn evo "moviepy<2" pillow-heif plyfile pycolmappip install git+https://github.com/IDEA-Research/GroundingDINO.gitpip install git+https://github.com/facebookresearch/sam2.gitpip install fairscale git+https://github.com/xinyu1205/recognize-anything.git
wget https://huggingface.co/spaces/xinyu1205/recognize-anything/resolve/main/tag2text_swin_14m.pth?download=true
mv tag2text_swin_14m.pth?download=true checkpoints/tag2text_swin_14m.pthpip install diffusersInstall the following packages to run the web server:
pip install fastapi uvicornYou need to create a Python file that stores the API in a dictionary format within a variable named FUNCTIONS, and set the file path in server.py.
Then, run the server using the command below:
uvicorn server:app --port 8000