This is an old revision of the document!
SFTP/FTPS file transfers in Microsoft Azure WebJob
If you need to export/backup data from your Microsoft Azure WebSite to a remote server or import data from the remote server to your WebSite, you can use WinSCP from Azure WebJob.
The Azure WebJob is a script or an application run on Azure WebSite server that has a read/write access to your web site. The job can be executed on demand or schedule. To implement file transfers for the WebJob, you can either run WinSCP in a scripting mode from a batch file; use WinSCP .NET assembly from a PowerShell script or develop a (console) application that uses the WinSCP .NET assembly.
Advertisement
You can also use WebJob without having an actual web site as a cloud-hosted scheduled task.
Note that WinSCP 5.5.x and older is not compatible with a restricted Azure environment. You need to use WinSCP 5.6.x beta.
- WebJob structure
- Deploying WebJob
- WebJob Environment
- Copying Session Log to Job Log
- Implementation
- Further Reading
WebJob structure
WebJob is a package that contains your job executable (e.g. a batch file or a console application) along with all supporting files (script, WinSCP executable, WinSCP .NET assembly, etc). Azure platform automatically tries to detect what executable is the main one. To ease this, you should name the main executable run.*
, e.g. run.bat
for a batch file.
See implementation sections below for examples of the package contents.
Deploying WebJob
To deploy your web job:
- Pack all the job files into a ZIP archive;
- Navigate to your web site page on Azure Management Portal;
- Switch to WebJobs tab;
- Use Add command on bottom bar;
- Give your WebJob a name; pick your ZIP archive and set up how the job is run.
Advertisement
After your WebJob is uploaded, it is extracted to /site/wwwroot/App_Data/jobs/type/name
, where type
is triggered
for Scheduled and On demand jobs and continuous
for Continuous jobs. From now on, you can manage/update the job files using FTPS.
WebJob Environment
In the WebJob environment, your web site data (as you see them over FTPS) are accessible at D:\home
. The path is stored in an environment variable HOME
. Other useful environment variables are:
Variable name | Description | Example |
---|---|---|
HOME |
Web site root directory | D:\home |
WEBJOBS_DATA_PATH |
Job data directory | D:\home\data\jobs\type\name . |
WEBJOBS_NAME |
Job name | |
WEBJOBS_PATH |
Temporary directory, where job is running | C:\DWASFiles\Sites\~1sitename\Temp\jobs\type\name\lpej4hrk.fks |
WEBJOBS_RUN_ID |
An unique ID of the job run (an UTC timestamp of the run). There’s a data folder created for every run in %WEBJOBS_DATA_PATH%\%WEBJOBS_RUN_ID% |
201409090707416329 |
WEBJOBS_TYPE |
Job type | triggered or continuous |
WEBROOT_PATH |
Web site root directory | D:\home\site\wwwroot |
WEBSITE_SITE_NAME |
Web site name |
When the WebJob is executed, its files are cloned to a temporary folder (see WEBJOBS_PATH
) and the job is run there. So it makes no sense to store any data to job’s working directory, as that is not persistent and is not accessible remotely. Use WEBROOT_PATH
to locate either job’s master directory (%WEBROOT_PATH%\App_Data\jobs\%WEBJOBS_TYPE%\%WEBJOBS_NAME%
) or WEBJOBS_DATA_PATH
to locate job’s data directory. See implementation sections below for examples.
Copying Session Log to Job Log
The standard output of the job is redirected to a log file which you can display on Azure Management Portal.
It is useful to have WinSCP session log file included into the job’s log. For that as the last step of the job, print the WinSCP session log to the standard output. See implementation sections below for examples.
To display the job log, go to WebJobs tab of your web site page on Azure Management Portal and click on a link in Logs column of the job.
Implementation
Using Scripting
If your transfer/synchronization task is simple, use WinSCP scripting. For complex tasks, see below for solutions using WinSCP .NET assembly from a PowerShell script or console application.
Advertisement
Start with learning how to automate file transfers (or synchronization) to FTP server or SFTP server.
Comparing with generic WinSCP script examples, your Azure transfer job should:
- have a wrapper batch file named
run.bat
so that Azure correctly identifies it as the main executable; - the wrapper batch file should propagate WinSCP exit code, so that Azure correctly identifies failures (any non-zero exit code is a failure to Azure, the same for WinSCP);
- enable session logging to a unique job run directory (
%WEBJOBS_DATA_PATH%\%WEBJOBS_RUN_ID%
); - locate source or destination paths of the transfer using WebJob environment variables;
- print session log to a standard output, so that it is available from Azure Management Portal.
An example script (script.txt
) that backs up the WebSite to a remote SFTP server:
option batch abort option confirm off # Connect open sftp://user:password@example.com/ -hostkey="ssh-rsa 2048 xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx" cd backup # Use a job run ID (timestamp) as a backup ID mkdir %WEBJOBS_RUN_ID% # Upload a whole web site put %WEBROOT_PATH%\* %WEBJOBS_RUN_ID%/ exit
An example wrapper batch file (run.bat
):
@echo off set LOG=%WEBJOBS_DATA_PATH%\%WEBJOBS_RUN_ID%\session.log echo Running script... winscp.com /script=script.txt /log=%LOG% set RESULT=%ERRORLEVEL% echo Result code is %RESULT% rem Dumping session log to Azure log (standard output) when it exists if exist %LOG% ( echo Session log: type %LOG% ) rem Propagating WinSCP exit code to Azure exit %RESULT%
A complete WebJob package (to be deployed to Azure WebSite) consists of following files:
winscp.com
executable;winscp.exe
executable;script.txt
script;run.bat
wrapper batch file;- optionally a private key file for SSH authentication.
Advertisement
Using PowerShell
For a complex standalone transfer/synchronization tasks, you can use WinSCP .NET assembly from a PowerShell script.
Start with learning how to use WinSCP .NET assembly from PowerShell.
Comparing with generic WinSCP PowerShell examples, your Azure transfer job should:
- have a wrapper batch file named
run.bat
so that Azure correctly identifies it as the main executable (Azure WebJob cannot run PowerShell scripts directly); - the PowerShell script should propagate any error from WinSCP code to an exit code, so that Azure correctly identifies failures (any non-zero exit code is a failure to Azure);
- enable session logging to a unique job run directory (
%WEBJOBS_DATA_PATH%\%WEBJOBS_RUN_ID%
); - locate source or destination paths of the transfer using WebJob environment variables;
- print session log to a standard output, so that it is available from Azure Management Portal;
- cannot use
Write-Host
cmdlet in Azure WebJob as there is no console to write to, useWrite-Output
instead.
An example PowerShell script (backup.ps1
) that backs up the WebSite to a remote SFTP server:
try { # Load WinSCP .NET assembly Add-Type -Path "WinSCPnet.dll" # Setup session options $sessionOptions = New-Object WinSCP.SessionOptions $sessionOptions.Protocol = [WinSCP.Protocol]::Sftp $sessionOptions.HostName = "example.com" $sessionOptions.UserName = "user" $sessionOptions.Password = "mypassword" $sessionOptions.SshHostKeyFingerprint = "ssh-rsa 2048 xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx" $sessionLogPath = "$env:WEBJOBS_DATA_PATH\$env:WEBJOBS_RUN_ID\session.log" $session = New-Object WinSCP.Session try { $session.SessionLogPath = $sessionLogPath # Connect Write-Output "Connecting..." $session.Open($sessionOptions) $backupPath = "/home/user/backup/$env:WEBJOBS_RUN_ID"; $session.CreateDirectory($backupPath) Write-Output "Uploading..." $session.PutFiles("$env:WEBROOT_PATH\*", "$backupPath/*").Check() } finally { # Disconnect, clean up $session.Dispose() } $result = 0 } catch [Exception] { Write-Output $_.Exception.Message $result = 1 } # Dumping session log to Azure log (standard output) when it exists if (Test-Path $sessionLogPath) { Write-Output "Session log:" Get-Content $sessionLogPath } # Reporting result to Azure exit $result
Advertisement
An example wrapper batch file (run.bat
):
powershell.exe -ExecutionPolicy Unrestricted -File backup.ps1
A complete WebJob package (to be deployed to Azure WebSite) consists of following files:
winscp.exe
executable;winscpnet.dll
.NET assembly;backup.ps1
script;run.bat
wrapper batch file;- optionally a private key file for SSH authentication.
C# Console Application
TODO