Soramai · Docs
Getting started.
Sign up, top up, prepare a dataset, fine-tune a LoRA adapter, test it in the playground, and deploy it as a live API. Total time: about 15 minutes the first time, 5 minutes after that.
Before you start
One account, one supported browser, and a few dollars of credits is all you need. No CUDA installs, no GPU drivers, no Kubernetes.
- A Soramai account. Created via Google or Apple sign-in. New accounts receive 30 starter coins so you can run a small fine-tuning job before topping up.
- A modern desktop browser. Chrome, Safari, Firefox, or Edge from the last two years. Mobile works for monitoring existing runs but the fine-tuning launch flow is desktop-first.
- A dataset (or a prompt). Either bring your own JSONL file, or describe what you want and let Soramai generate it for you.
- $5–$25 in credits for your first end-to-end run. See the pricing page for exact pack sizes.
Step 1 · Create your account
Soramai uses passwordless sign-in. Apple or Google handles the credential and recovery.
- 1Go to soramai.com/signup.
- 2Click Continue with Google or Continue with Apple.
- 3Approve the OAuth consent screen. Your account is created instantly with 30 starter coins.
- 4You’ll land on the dashboard with your wallet balance visible top-right.
Step 2 · Top up credits
Skip this step if 30 starter coins is enough for your first run. A 100-step Beginner-tier fine-tuning costs about 8 coins.
- 1On the dashboard, scroll to Add credits.
- 2Pick a pack. The 20,000 coins / $24 pack is the most common starting point — it covers 30–60 small fine-tuning runs.
- 3Click Checkout. You’re redirected to Stripe.
- 4After payment your balance updates within 30 seconds. A receipt is emailed automatically.
Credits never expire while the account is active. Refunds for failed jobs caused by platform faults (worker crash, GPU eviction, adapter validation failure) are issued automatically — you only pay for fine-tuning time that produced a usable result.
Step 3 · Prepare a dataset
You have three options. Pick whichever matches what you already have on hand.
Generate with AI →
Describe the data you want and let Soramai produce it. Great when you want a fast prototype without curating examples by hand. 500-row datasets generate in 60–120 seconds.
Write it yourself →
Open Dataset Studio in the dashboard and type rows directly. Best when you have specific examples in mind and want full control.
Upload a JSONL →
Drop a .jsonl file into Dataset Studio. Each line is a JSON object with prompt and response fields. Up to 20 MB / 50,000 rows.
Minimum viable dataset shape (JSONL):
{"prompt":"Summarise the changelog: ...","response":"Released 1.4 ..."}
{"prompt":"Classify this ticket as bug/feature: ...","response":"feature"}
{"prompt":"...","response":"..."}Both prompt and response are required. Optional system_prompt is honoured if present.
Step 4 · Launch a fine-tuning run
Two clicks once your dataset is ready.
- 1Open soramai.com/train.
- 2Pick a base model from the dropdown. Beginner (Qwen 2.5 7B) is the default and the cheapest. Switch to Pro or Elite tiers for larger models.
- 3Click Pick from My Datasets and select the one you prepared. Or paste a signed URL.
- 4Pick a step count. 100 steps is a good first run — ~3 minutes, ~8 coins. Bump to 500 once you’re happy with the recipe.
- 5Confirm the cost estimate, click Launch fine-tuning. You’ll be sent to the live fine-tuning detail page.
The dashboard live-bills the run per second. You can safely close the tab — the worker keeps running and your balance updates in the background. A push notification arrives when the run completes (if you have the iOS app installed).
Step 5 · Test in the playground
Verify the model learned what you wanted before deploying. The playground is free except for GPU runtime.
- 1From the completed fine-tuning page, click Open in playground.
- 2A warm inference pod starts (~30 seconds first time). After that, messages return in under a second.
- 3Try 5–10 prompts that represent your real workload. If the model nails them, you’re ready to deploy. If not, add more fine-tuning examples and re-train.
- 4Playground sessions auto-terminate after 10 minutes of idle to save coins. You’re billed per minute of warm GPU time.
Step 6 · Deploy as an API
One click to turn a fine-tuned adapter into an autoscaling HTTPS endpoint your code can call.
- 1From the completed fine-tuning page, click Deploy. Or open soramai.com/deployments and pick from your model list.
- 2Set a deployment name (just for your reference) and confirm.
- 3Soramai provisions a serverless inference endpoint. The dashboard shows the URL and an API key (shown once — copy it now).
- 4Test the endpoint with the curl snippet shown on the deployment page, or jump to the Inference & Deploy guide for the request schema.
First request:
curl https://soramai.com/api/v1/inference \
-H "Authorization: Bearer sk-ai-xxxxxxxxxx" \
-H "Content-Type: application/json" \
-d '{"prompt": "Hello"}'Next steps
You’ve done the full loop. From here, pick whichever path matters most for your project.
Datasets reference →
JSONL schema, image-dataset format, validation rules, multi-dataset merging, AI generation parameters.
Fine-tuning reference →
Base model catalogue, LoRA hyperparameters, monitoring metrics, refund policy for platform faults.
Inference & Deploy →
Playground vs Deploy API, request and response shapes, streaming, rate limits, billing.
Pricing →
Coin pricing, per-tier rates, worked examples for typical fine-tuning and inference workloads.
Security →
Data handling, isolation, encryption, responsible disclosure.
Status →
Live service health for the dashboard, fine-tuning pool, inference endpoints, and billing.