A2E FAQ
Helpie FAQ
- What are credits?
Credits are used to generate assets (videos and audios). We offer free credits to new users. After the trial period, credits can be purchased by Stripe or Paypal.
As of April 26th 2025, the cost of A2E functions are:
- AI avatar video (Lipsync video): 1 credit per second. For example, if you create a 1 minute ai avatar video, it costs 60 credits
- Image to video: 30 credits per video (5 seconds long)
- Talking photo: 30 credits per photo
- Voice clone: free of charge. You only pay for the length of the content generated.
- Video Avatar (create custom ai avatar from a video): free of charge. You only pay for the length of the content generated.
- Studio Avatar (refine the lip-sync quality of a video avatar):100 credits per avatar
- Eye Contact (refine the eye gaze direction of a video avatar): 100 credits per avatar
- What payment methods are accepted?
We accept payments via Stripe and Paypal.
- Is it safe to make payments on a2e.ai?
Yes, your payment security is our top priority. We use industry standard stripe.com and paypal.com to process the payment. We will never know or store your financial information. SSL security to ensure that your credit card information is 100% protected.
- Can I get a refund on purchased credits?
Since we offer free trial with free avatar clone and free video synthesis, we do not refund your purchase. However, if you accidentally make duplicate orders, please contact our support team within 72 hours of your purchase.
- Do I need to enter my credit card for the free trial?
No, you can sign up and enjoy the free trial without entering any credit card information.
- Are there any hidden fees when making payments?
There are no hidden fees. You pay for the credits, and you use those credits according to our fee schedule. You can review the pricing details on our website before making a purchase.
- Who can I contact if I have problems with my payment?
If you encounter any difficulties with your payment, please reach out to our customer support team at Discord or contact@a2e.ai immediately for assistance. We’re here to help!
- Would you be able to give us a limited API access key so we can evaluate the platform?
We offer free credits for every new developers. Simply sign up your new account at https://video.a2e.ai/. Your account will automatically gain access to the API.
Please read our API doc: https://api.a2e.ai to know how to obtain the API token by using your account id.
A new user automatically gets 100 credits for free, which should be enough for an initial API test. Please consider purchasing additional coins starting from $9.9 if you need more evaluation (and the ultimate application).
P.S. if you just want to quickly evaluate the visual results of AI avatars, you may want to skip code development and use our web UI @ https://video.a2e.ai The web UI is built upon the same API.
- What kind of support structure do you have in place for API developers?
The recent SLA is ~99.6%. Our tech team monitors outages 24/7.
In case you experience an unusual failure / timeout, you can always reach us by email (usually replied within 1 business day). If your average monthly pay > $500, e.g. you get a Dedicated Line, you can also reach us by whatsapp (usually replied within 2 hours during business hours).
We have customer support in San Francisco and Korea. The business hours apply to our coworkers in these two places.
- What is “Credit Top Up”
Users in Creator Tier is able to purchase additional credits. The price of top-up is:
- $9.9 = 600 credits
- $19.9 = 1800 credits
- $80 = 10,000 credits
The additionally purchased credits do not expire, and valid to use forever.
Credit top-up is a great way for API developers to bootstrap their projects.
- How does free daily credits work?
Users in Free and Creator Tier will receive free credits every day. The free credits are issued every 0:00 UTC daily. Currently:
- Credits of the users of the Free Tier will be reset to 30 if their credit is less than 30.
- Credits of the users of the Creator Tier will be reset to 60 if their credit is less than 60.
The credits will not increase if the user’s credit balance is larger than the threshold. For example, if a user of the Free Tier owns 80 credits (obtained by referral program) at 0:00 UTC, his balance will remain the same.
- What is A2E’s Image-to-Video tool?
A2E’s Image-to-Video tool uses AI to turn any image into a short, dynamic video with facial movement and expression. No video editing required.
- What types of images work best?
High-quality, front-facing portraits or clear subject images with minimal background noise yield the best results. We support PNG, JPG, JPEG, and WEBP formats with a maximum file size of 10MB, and the tool can generate videos in up to 4K resolution.
- How long does it take to generate a video?
Most videos are ready within 10–30 seconds, depending on complexity and server load.
- Is the generated video reusable or editable?
Yes. You can reuse, download, or regenerate videos with new inputs or effects.
- Can I use the videos for commercial projects?
Videos created with your own content are typically fine for commercial use, but please check A2E’s terms of use for full licensing details.
- Do I need any editing skills to use this tool?
Not at all. The tool is designed for both beginners and prosumers—just upload, describe, and go.
- Is my content private and secure?Is
Yes. Your uploaded content is securely processed.
- Is this free to use?
When you register as a new user, you will receive 100 points. Afterwards, you can earn 30 points daily, and you also have the option to subscribe to a plan.
- What is the “Link to video” API?
The “Link to video” API allows you to convert any web page URL into a short video ad. Simply make an API call containing the URL, and Creatify will gather all the necessary information from the page.
- What sets A2E AI API apart from other video generation APIs?
A2E API offers digital character AI API lip synchronization capabilities. Our API is designed to generate ready-to-use marketing videos that do not require any further post production.
- What are the prerequisites for using the A2E Video API?
To use the A2E Video API, you need an active account at A2E, and go through authentication process described in API documentation.
- How do I create a new link using the “Link to video” API?
To create a new link, send a POST request to the /api/links/ endpoint with your API ID and API Key in the headers, and the URL you want to convert in the request body. The API will return a link ID that you can use to create a video.
- Can I update the link information before creating a video?
Yes, it’s recommended to update the link information before creating a video. You can provide a logo URL, remove low-quality image/video URLs, or update the description to highlight specific features. Use the PUT request to the /api/links/{id}/ endpoint to update the link.
- How do I create a video from a link?
To create a video from a link, send a POST request to the /api/link_to_videos/ endpoint with your API ID, API Key, and the link ID you obtained earlier. You can customize the video by specifying the visual style, script style, aspect ratio, video length, language, target audience, and target platform.
- How can I check the status of a video?
After creating a video, you will receive a video ID. Use this ID to send a GET request to the /api/link_to_videos/{id}/ endpoint to check the status of the video. When the status is done, you will find the video output in the video_output field.
- What is the “Lip synchronization” API?
The “Lip synchronization API” is a text to video AI API that converts text or audio files into realistic videos of people speaking. Our AI model generates high-quality videos with accurate lip movements based on the provided text or audio.
- How do I generate a video using the “Lip synchronization” API?
To generate a video, send a POST request to the /api/lipsyncs/ endpoint with your API ID, API Key, the text you want the person to speak, the creator ID, and the desired aspect ratio. The API will return a lipsync item ID.
- How can I check the status of a lipsync video?
After queuing a generation task, you will receive a lipsync item ID. Send a GET request to the /api/lipsyncs/{id}/ endpoint with this ID to check the status of your video generation. When the status is done, you can download the video using the link provided in the output field.
- How many personas are available for the “Lip synchronization” API?
We offer over 50 different personas for the “Lip synchronization” API. You can find the full list of available personas in our documentation.
- What is the latency of the A2E Video APIs?
The latency of the A2E Video APIs depends on the specific API you are using. For the “Link to video” API, there are two separate requests:
- Link creation: This process is fast and typically takes less than 5 seconds.
- Link to video: Generating a video from a link takes around 5 minutes. This is because the API needs to perform several tasks, such as generating a script, generating audio, matching an avatar, finding the best assets to display, and synchronizing the lip movements.
For the “Lip synchronization” API, the latency is approximately 1:10, meaning that generating a 15-second video will take about 150 seconds. The latency also depends on the avatar resolution you choose. Higher resolution avatars will take longer to generate compared to lower resolution ones.
- How do I get started with A2E’s AI Video API?
To get started with A2E’s AI Video API, follow these simple steps:
- Create a A2E account and obtain your API ID and API Key.
- Review the API documentation to understand the available endpoints and request parameters.
- Make API requests to create links, update link information, and generate videos.
- Monitor the status of your video generation tasks using the provided endpoints.
- Retrieve and use the generated video content in your applications or campaigns.
- Is support available during the API integration process?
Yes, A2E offers dedicated support to assist you during the API integration process. Our knowledgeable support team is available to answer your questions, provide guidance, and troubleshoot any issues you may encounter. We strive to ensure a smooth integration experience and are committed to helping you successfully leverage our AI Video API in your projects. To get support please use chat options available on the website and app, or join our Discord server. Alternatively you can contact us via email at contact@a2e.ai.
- What are some common use cases for A2E’s AI Video API?
A2E’s generative AI Video API is versatile and can be applied to various use cases, such as:
– E-commerce product promotions and advertisements
– Social media marketing campaigns
– Explainer videos and tutorials
– Personalized customer onboarding and support
– News and entertainment content creation – Educational content and online course materials
- What’s Virtual Try On?
Virtual Try-On is a technology that lets users see how clothes, accessories, or makeup would look on them — without physically wearing or applying them.
Using AI and computer vision, it overlays virtual items onto a photo,
A2E hopes to make changing outfits and fashion styles less burdensome for users. - What file format does A2E App support?
We accept common image formats including JPG, PNG, and WEBP files. For best results, use high-resolution images that represent garments and the model clearly.
- What clothing items can be swapped with the AI?
Our AI clothes changer app can handle most common garment types including tops, dresses, pants, skirts, and outerwear.
- Can I change clothes in group photos?
Currently, A2E Virtual Try On works best with individual photos. For best results, use photos featuring a single person with a clear, unobstructed view of their body.
- Is it free to swap clothes with AI?
When registering an account, you receive several free credits to try on clothes.
- How many outfit changes can I generate?
The number of outfit changes depends on available credits on your subscription plan. The more credits you have, the more outfits you can try on with our AI.
- A2E Virtual Try On handle various clothing styles?
Yes, A2E is designed to adapt to a wide range of clothing styles, from casual to formal, ensuring a versatile try-on experience.
- Is suitable for all body types?
Absolutely! Our platform is built to accommodate different body types, offering a personalized and inclusive virtual try-on experience.
- How realistic are the try-ons with Virtual Try On?
A2E‘s Virtual Try On provides highly realistic try-ons using advanced AI that meticulously adapts clothing to the user’s body and the chosen outfit’s style and texture.
- Do you offer an API ?
We do not offer an API at this time. Our team is focused on enhancing the platform’s core features and may consider API availability in the future.
- How does the AI Product in Hand feature benefit my business?
It reduces marketing costs by eliminating traditional photoshoots and creates engaging product visuals that drive sales.
- Can customers see different types of products held by models with your technology?
Yes, our AI tool supports a wide range of products, ensuring lifelike imagery for diverse items.
- Is the AI product visualization experience comparable to traditional photoshoots?
Absolutely, the AI-generated images offer photorealistic quality comparable to professional photoshoots.
- How easy is it to implement AI Product in Hand for my product collection?
Our platform offers seamless integration with minimal technical complexity, allowing effortless implementation across your inventory.
- What is an AI Product Avatar?
An AI Product Avatar is a digital model that can hold and showcase your product in realistic video formats. It eliminates the need for traditional photoshoots, allowing you to create high-quality, customizable videos in minutes.
- What is voice cloning?
Voice cloning is the process of replicating or synthesizing a person’s voice, typically from audio samples, to create its digital replica which can be used to generate speech for various purposes, such as presentations, podcasts, audiobooks, and video voiceovers.
- How many voice samples do I need to upload?
You can upload up to one voice recordings to create your voice clone. You should upload an audio file with total duration >= 8 seconds and <= 60 seconds. The voice quality is more important than audio length. We recommend uploading high quality audio in wav format.The more varied and clear the recordings, the better the quality of the final voice model.
- How long does it take to create a voice clone?
The processing time depends on the length and quality of your recordings, but typically, you can expect your voice clone to be ready within 2 minutes after uploading.
- What are the requirements for the voice recordings?
The higher the quality of your voice samples, the better the resulting voice model will be. Your recordings should feature a single speaker, be clear, and be free from background noise, music, and effects such as echo or reverb. Avoid long silences, multiple speakers, and ambient noise like air conditioners or street sounds.
- Is my data secure?
Yes, your data is safe with us; we take your privacy and security seriously. All uploaded recordings are handled with care, and we only retain your data for as long as needed to generate your voice clone. Your voice samples are not used for further training of our voice cloning technology or to enhance our other AI products.
- What is AI face swap?
AI face swap refers to the use of artificial intelligence (AI) technology to digitally replace one person’s face in an image or video with another person’s face. This technology employs sophisticated algorithms to analyze facial features, expressions, and movements, allowing for seamless and realistic swapping of faces.
- Is my facial data secure when using this tool?
Absolutely. We prioritize your privacy and security, and we never store your facial data.
- Are the face-swapped results realistic?
Yes, our advanced AI technology ensures natural and realistic results by accurately integrating the target face into the base photo.
- Can I replace my face with any image?
Yes, of course. Face swap allows you to replace your face without any other selected face. For example, you can put a male face on a female portrait, or an aged face on a baby photo. You can be anyone you want without any limits!
- How to achieve the best result?
Here are some tips to help you get the best face swap.First, choose a facial photo with clear facial features. Avoid uploading blurry photos for processing.Second, make sure the photo is taken front-facing, without any obstructions like hands or other objects covering the face.
- What is video face swap?
Video face swapping involves digitally exchanging the face of an individual in mp4 files. This process replaces one person’s face with another, resulting in a humorous or entertaining effect. Face swapping on mp4 is not only possible, but it’s also fun. With A2E FaceSwap, you can effortlessly create personalized, fun moments.
- Can I swap faces between animals and humans?
Unfortunately, the face swap only works for human images. It is designed to detect and align human facial features to ensure accurate and realistic results.
- What is a talking photo?
A talking photo is a static image enhanced with AI to simulate human-like speech and expressions, transforming it into a dynamic and engaging character. It’s perfect for e-learning, greeting videos, product explainers, customer service, and more, by generating realistic voiceovers and animations based on a portrait with audio. This simple, efficient, and budget-friendly way to create content adds a personal touch and helps build stronger connections at scale with ease.
- Can I use any image to generate a talking photo?
Yes, A2E supports all types and styles of photos for talking avatars. From real humans and AI-generated avatars to half or full-body shots and expressive poses, A2E brings them all to life with stunning realism.
- What is the maximum duration supported for talking photo generation?
A2E currently supports to generate up to 1 minute long videos from photos.
- What languages can I add to make a photo talk?
The language support varies based on the input method you choose:
Text-to-Speech Input: A2E currently supports up to 29 languages, including English, Chinese, Spanish, Arabic, Russian, Portuguese, French, German, Korean, Japanese, Hindi, Thai, Vietnamese, Turkish, Filipino, Finnish, Czech, Danish, Dutch, Polish, Romanian, Slovak, Swedish, Croatian, Indonesian, Italian, Bulgarian, Greek, Malay, Tamil, Ukrainian.
Audio Uploads: A2E supports any language and dialect, allowing for unlimited flexibility.
- How many faces can I animate to make photos talk?
At the moment, A2E supports animating one face per photo.
- Is It Safe to Create a Talking Avatar?
Yes! You can stay 100% safe to create talking AI avatars using A2E, no virus, no malware and no data leak. - Why Can’t I Generate a Talking Photo?
There are two possibilities:
1. Your photo doesn’t meet our requirements.
You must upload a clear front face image under 10MB. Photos with facial occlusion will fail to pass our moderation process. Don’t worry, we will inform you when your photo can’t be recognized. Upload a qualified one to try creating again.
2. Your script includes inappropriate or sensitive information.
It is suggested to use a script with formal, moderate and positive words. If some illegal and restricted content appears, such as sexual abuse, fraud scheme, terrorism & violence, private information, you can’t create the talking avatar video successfully.
- What is head swap?
Head swap traditionally refers to an editing technique to replace someone’s head with another one in an image. Thanks to AI technology, people can easily and quickly swap heads online free in a few clicks, no skills needed.
- How does AI work for head swap?
AI-driven head swap tool intelligently analyzes all details of the uploads and quickly detects heads in the photos. Next, perform the head to head transformation seamlessly and generate a natural-looking result.
- What is the best AI head swap tool?
A2E AI Head Swap is the best online free head swap AI tool. It can not only swap a whole head with hair in a photo, but enables you to replace head with any character.
- Is it free to swap heads online?
New users receive bonus credit, which are spent when using features.
- Can you swap heads in any photo?
A2E AI Head Swap supports multiple image formats, including JPG, PNG and WebP. You can upload any photo with supported formats for head swap.
- Can you swap heads in videos?
A2E AI Head Swap is not available to replace heads in videos in the current version. But it will add a video head swap feature soon. Stay tuned.
- Can you swap multiple heads at once?
Unfortunately, you can’t swap multiple heads with A2E AI Head Swap right now. The tool is continuously updated to support new functions. Stay tuned.
- How to achieve the best head swap result?
There are some small tips.
1. Please use clear front-facing images.
2. Please confirm that there is no cover or blur on the head and face.
3. You’d better upload high-quality photos with good lighting. - What is AI Dance ?
AI Dance is a photo dance program that animates still images into dancing videos.
- How Does This AI Dance AI Work?
A2E dancing photo AI works by first learning from a vast dataset of real human dancers and their moves, followed by analysis and generation of new AI dance move sequences using deep learning and AI algorithms.
- Is This Dance Animator to Use?
Yes. A2E is 100% for users all around the world, encouraging more audiences to have fun with AI tech and dances.
- Can I Make a Picture Talk?
Yes! Everything is possible with A2E AI. To make a picture talk, you can head to A2E Talking Photo. More than that, A2E offers various AI tools such as a face swapper, and AI dubbing to have fun with.
- Can I share my AI-generated dance video on social media?
Absolutely! Once your dance video is generated, you can easily save it and share it on popular social media platforms like TikTok, Instagram, and Facebook. Show off your unique dance moves and go viral!
- Is commercial use allowed?
Yes, commercial use is permitted for the generated images. You may utilize these images for any legal purposes.
- Is there an API available?
Yes! To access all available APIs, please check our documentation here.
- Is there a copyright on the output?
The images generated by the AI are not subject to copyright.
- Can I get higher resolution or higher quality images?
Currently, we do not offer images in higher resolutions or qualities. The available quality will improve over time.
- Is the quality of the images suitable for printing?
The quality is generally sufficient for printing smaller images. However, larger prints might appear blurry.
- What does Text to Image AI mean?
Imagine having a magical artist living inside your computer who can draw anything you describe. That’s essentially what a Text to Image AI generator is. In other words, it’s a sophisticated digital tool that turns written descriptions into visual images. You type in a description, like “a cat wearing a top hat and monocle”, and Image Generator or one of our other tools uses its vast knowledge of images and language to create a picture that matches your prompt.
Text to Image AI has a wide range of applications, from helping designers visualize concepts quickly to assisting writers in creating book covers or marketing materials. It’s particularly useful for generating unique, customized images that might be difficult, time-consuming, or expensive to produce through traditional means.
- How does AI generate image from text?
The model has been trained by studying an extensive set of images and their descriptions, much like a student learning from countless textbooks and examples. This training allows the AI to understand the connection between words and visual elements.
- Do you have a tool to generate images of a target person?
Yes, you can use our text-to-image tool.
- Where can I use the music generated on a2e.ai?
You can use the AI music generator to generate music for Video content (YouTube), Podcasts, Games, Short films/Trailers, AI Art, Social Media, Audiobooks, Advertisements, Livestreams.
A2E.ai can be used for all your background music requirements across platforms.
- Do I own the rights to the track I downloaded from a2e.ai?
You get a non-exclusive perpetual licence for the generated and downloaded track. This licence gives you the rights to use the music for your video or audio content (podcast, talk show, audiobook) and monetise the content worry free.
However, A2E.ai will still be the owners of the tracks generated and downloaded from the ai music creator. - How is A2E AI Different from Other Audio Generation Tools?
A2E AI stands out with its advanced contextual understanding, real-time processing capabilities, and high-quality output. Compared to traditional solutions, our AI technology delivers more natural and accurate audio generation results.
- How Does A2E AI Generate Audio for Videos?
A2E AI uses advanced machine learning algorithms to analyze video content and generate corresponding audio. Our AI system understands visual context, actions, and scene elements, creating sound effects and ambient audio that perfectly match your video.
- Can It Handle Different Types and Lengths of Videos?
Yes! Our AI system is designed to process any video format and length (supports 1–30 seconds). Whether you’re working with short clips or longer content, A2E AI delivers consistently high-quality results.
- What is Nano Banana?
It’s a revolutionary AI image editing model that transforms photos using natural language prompts. This is currently the most powerful image editing model available, with exceptional consistency. It offers superior performance compared to Flux Kontext for consistent character editing and scene preservation.
- How does it work?
Simply upload an image and describe your desired edits in natural language. The AI understands complex instructions like “place the creature in a snowy mountain” or “imagine the whole face and create it”. It processes your text prompt and generates perfectly edited images.
- How is it better than Flux Kontext?
This model excels in character consistency, scene blending, and one-shot editing. Users report it “completely destroys” Flux Kontext in preserving facial features and seamlessly integrating edits with backgrounds. It also supports multi-image context, making it ideal for creating consistent AI influencers.
- Can I use it for commercial projects?
Yes! It’s perfect for creating AI UGC content, social media campaigns, and marketing materials. Many users leverage it for creating consistent AI influencers and product photography. The high-quality outputs are suitable for professional use.
- What types of edits can it handle?
The editor handles complex edits including face completion, background changes, object placement, style transfers, and character modifications. It excels at understanding contextual instructions like “place in a blizzard” or “create the whole face” while maintaining photorealistic quality.
- Do I need to upload an image to use it?
You can either generate images directly from text descriptions or upload existing images for editing.
- How fast is the generation speed?
Generation typically takes 3-5 seconds, depending on the image complexity and resolution.
- Can I edit the same image multiple times?
Yes, you can continue editing based on the initial generated result.
- What is Viggle(Actor Swap)?
face swap apps, it delivers full photo-to-video motion transfer. - Is Viggle(Actor Swap) similar to Viggle?
Yes. Actor Swap is a Viggle-like product, offering the same photo-to-video animation concept but with more realistic motion replacement, higher-quality results, and a more affordable price.
- Can I use Viggle(Actor Swap) for social media?
Absolutely. Actor Swap videos are perfect for TikTok, Instagram, and YouTube creation.
- How does Viggle(Actor Swap) work?
- Upload a photo.
- Select a target video.
- Actor Swap replaces the actor in the video with your photo, following all motions.
- Download and share.
- How does A2E AI talking video work?
A2E uses advanced deep learning technology, combining GAN (Generative Adversarial Networks) and SyncNet synchronization detection networks to precisely analyze audio phoneme features and automatically reconstruct lip movements in videos to achieve perfect synchronization with new audio. This technology is widely used in film post-production, content creation, and corporate communications.
- What video and audio formats are supported for lip synchronization?
We support mainstream formats: Input video formats include MP4 (H.264 encoding recommended), input audio supports MP3, WAV, M4A and other formats. Output is high-quality MP4 video files supporting 720P to 1080P resolution. We recommend video resolution under 1920×1080 for optimal processing results and speed.
- What is the accuracy of AI lip sync? How long does it take to process a video?
Our AI model has high lip synchronization accuracy and can handle various languages and dialects. Processing time depends on video length and complexity: typically a 1-minute video takes 10-20 minutes to process, with complex scenes potentially requiring longer. We continuously optimize processing speed to provide better user experience.
- What’s the difference between Free and Professional plans? How to choose the right plan?
The Free version is available on the try-free page for experiencing basic lip sync functionality, suitable for personal testing and light usage; the Professional plan offers higher quality output, faster processing speed, batch processing, priority technical support and other advanced features. For API service access, please contact us for customized solutions. For commercial use or high-frequency usage needs, we recommend the Professional plan.
- What types of videos are supported? Are there any limitations?
Currently, we mainly support single-person videos for lip synchronization with optimal results. Videos should have clear and visible faces and mouth areas. Widely used in: personal video content creation, online education courses, corporate training videos, product introduction videos, social media content, and other scenarios. Multi-person simultaneous speaking complex scenes are not currently supported.
- How is video content security guaranteed?
We value user privacy protection. Uploaded video files are processed on our servers and will be periodically cleaned and deleted after processing completion. We recommend users not to upload videos containing sensitive information. For special security requirements, please contact us to discuss solutions.