Instructions to use abinayam/gpt-2-tamil with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use abinayam/gpt-2-tamil with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="abinayam/gpt-2-tamil")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("abinayam/gpt-2-tamil") model = AutoModelForCausalLM.from_pretrained("abinayam/gpt-2-tamil") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use abinayam/gpt-2-tamil with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "abinayam/gpt-2-tamil" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "abinayam/gpt-2-tamil", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/abinayam/gpt-2-tamil
- SGLang
How to use abinayam/gpt-2-tamil with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "abinayam/gpt-2-tamil" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "abinayam/gpt-2-tamil", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "abinayam/gpt-2-tamil" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "abinayam/gpt-2-tamil", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use abinayam/gpt-2-tamil with Docker Model Runner:
docker model run hf.co/abinayam/gpt-2-tamil
limited pre-training and guard rails
hi Abinaya and team?:
This is great effort; I think you should write a paper and post to Arxiv on this topic and significant contribution for Tamil.
Can you release the methodology for training, tokenization and encoding representations ?
However since the model seems to be having some limited self correction and guard rails, and the model has limited cleanup of personally-identifiable information it should be mentioned in the announcement and user guide. There should be more guard rails added to this model and harmful content generation should be listed.
Thank you
-Muthu Annamalai
Hi @mannamalai : Sure, will try to write up a paper outlining the points you mentioned. This model was built as part of a hackathon and the amount of data used to pretrain the model is super less. We have plans to improve this model further as part of the AI Tamil Nadu's initiative.