Differences between revisions 91 and 108 (spanning 17 versions)
Revision 91 as of 2010-05-20 15:12:58
Size: 16689
Editor: FaheemMitha
Comment: Add socghop link
Revision 108 as of 2016-02-12 01:18:11
Size: 3417
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
= Google Summer of Code 2010 =
For the last few years, Google has offered a fantastic opportunity for students to help out Open Source software projects in the summer while getting paid for it. It's called '''Google Summer of Code™''', and it provides free software projects a great way of attracting development effort while providing software developers who are still in university with some interesting and useful experiences. Find out more about the Summer of Code (SoC) from [[http://code.google.com/summerofcode.html|their site]].
= Google Summer of Code =
Line 5: Line 4:
Mercurial is a Distributed Version Control System (DVCS), and we would like to participate in 2010's Summer of Code. In recent years (notably since the Linux kernel project started looking for a new VCS), DVCSs have rapidly started to gain traction. For open source projects, especially, the use of a distributed system can be a great enabler, in that it's much easier to keep track of your own changes and publish them back to the community in a structured way. With centralized systems, you have a central repository that only a happy few have (write) access to, and people spend a lot of time mailing around diffs. With distributed systems, everyone has their own branch, and it's very easy to publish it on the web, compare the changes to those on some other branch or merge two branches back together.

We believe distributed version control is the future, and it looks like many other technologists believe the same. There's some heavy competition between open source VCS systems: git, Bazaar-NG and Mercurial are constantly fighting over users, while many more conservative projects are still on Subversion or even CVS. Competition is fierce, stimulating fast-paced development in the three DVCS contenders. It's therefore important (and good for the competition) that projects like ours gain more developers, to keep up with the race for robustness, performance and features. We view the Summer of Code as a great opportunity to help us in this competition.

Written largely in Python (with the exceptions of a few performance-critical core modules), it is easy to develop for and allows for good code organization and clean abstractions. Moreover, it has been optimized from the start for good performance, using an append-only datastore, minimizing random disk access and smart uses of compression. Finally, it has extensive features for sharing changesets, over HTTP, SSH and in files over email and has an innovative extension (Mercurial Queues) to help building and organizing changesets into a thoroughly useful source history.
Information about Mercurial's GSoC program.
Line 13: Line 8:
== About Mercurial and GSoC ==

For the last few years, Google has offered a fantastic opportunity for students to help out Open Source software projects in the summer while getting paid for it. It's called '''Google Summer of Code™''', and it provides free software projects a great way of attracting development effort while providing software developers who are still in university with some interesting and useful experiences. Find out more about the Summer of Code (SoC) from [[http://code.google.com/summerofcode.html|their site]].

Mercurial is a popular project with many enthusiastic users and a dynamic development community. By participating in GSoC with Mercurial, you'll be exposed to exciting technology and top-notch developers.

<!> See [[SummerOfCode/Ideas2016|Ideas]] for project ideas and add yourself to [[SummerOfCode/2016|this year's status page]] to get involved.

=== Diversity ===

GSoC and open source development in general attracts a wide range of participants from around the world, but historically women have been very under-represented.
So this year, Mercurial (and other projects under the Python umbrella) are actively encouraging more women to apply. Read more about Python's [[http://wiki.python.org/moin/DiversityInPython|diversity efforts]].
Line 14: Line 22:
Here are some tips on what you might want to include in your application.
Line 16: Line 23:
 * First of all: get feedback on your proposal from the community. The MailingLists and IRC (#mercurial on irc.freenode.net) are quite responsive. Ask around and get to know some people, see if they think your project is feasible or how you should change the scope to better fit the timeline and the project. (If you are new to Open Source development, reading [[http://producingoss.com/en/communications.html#you-are-what-you-write|this part of "Producing OSS"]] may help you find the right tone.) Talking to people before applying dramatically increases your chances of approval: it shows us that you are genuinely able to communicate your ideas, allows us to gauge your expertise level and see who might be the right mentor for you. Here are some tips on how to apply:
Line 18: Line 25:
 * Tell us about yourself. We don't know you, so it helps if you give an outline of your background and what your prior experiences are (e.g. Open Source development, using Python, general software engineering experience or education). If it turns out you wouldn't be a good fit for Mercurial or your project idea, it's helpful for everyone involved if that can be determined before you get started. However, don't hesitate to apply just because this would be your first time working in Open Source or with Python; at some point this was new for each of us.  * Get acquainted! Introduce yourself on the [[MailingLists]] and talk to us on [[IRC]]. We're much more likely to accept applications from people we recognize.
Line 20: Line 27:
 * Make it clear that you've thought through your application. Don't use a project idea from this page verbatim. Instead, come up with your own proposal, or expand on a proposal from this page. Explore the code a little bit (it's Python, quite easy to read), read CategoryContributing to get a feel for how the community works. Make a clear schedule to show the intermediate milestones you'll reach while progressing towards your final goal.  * Get feedback on your proposal! We want to know all about you and your proposal before we have to choose between projects.
Line 22: Line 29:
 * Let us know why you care, what you like about DVCSs in general and Mercurial specifically, what things you think could be improved and how they could be improved. We like working on this project because this project makes our work more effective, and we hope you'll like it, too. Showing your motivation helps.  * Submit a patch! We ask applicants to pick a simple issue from the [[BugTracker]] and [[ContributingChanges|send a patch for it]]. This will help demonstrate some of the basic skills you'll need for your summer project (using our tools, reading the code, talking to people, etc.).
Line 25: Line 32:
Some tips on how you can get to project completion within the scheduled timeline.
Line 27: Line 33:
 * Working on Mercurial in the summer should be your main activity. Having a vacation for one or two weeks is fine, of course, but we want you to take the project and the time mentors put into it seriously. If you are working two other jobs this summer, that probably won't work. Realize that collaborating on software development takes time, and not just the time used to reason about and write the code. This also means that we want you to set some intermediate milestones to be able to keep track of your progress. The primary challenge of GSoC is not technical! Your goal as a GSoC student should be learning how to interact with other developers on a real-world project, and how to develop effectively in a cooperative environment.

 * Focus. Working on Mercurial in the summer should be your main activity. Realize that collaborating on software development takes time, and not just the time used to reason about and write the code. This also means that we want you to set some intermediate milestones to be able to keep track of your progress.
Line 31: Line 39:
 * We want you to work in the open, with our community. Get on the MailingLists, both to ask for help and to provide it to other users, spend time on IRC (#mercurial on irc.freenode.net) discussing your work with other developers and explaining Mercurial to all the new users coming in with questions. Set up a public repository with an MQ containing your patches against the [[CrewRepository|crew repository]] (on [[http://bitbucket.org/|Bitbucket]], for instance; see also MercurialHosting).  * Participate. Developing your own code is only part of the open source experience. You should also be getting involved with discussions on our mailing lists with users and developers, commenting on other code, responding to bug reports, editing the wiki, etc.
Line 33: Line 41:
 * We're not going to just compare what you did at the end to what you stated you'd be doing in the beginning. We want you to put effort into the project, to think about the feature you're doing, to communicate with the community and integrate your code with the project. If you end up implementing some other cool feature or fixing some annoyance, that is great as well.

== Project Ideas ==
Here are a bunch of project ideas you might like to apply for. Of course, if you have a different idea of something in Mercurial that badly needs fixing or some feature you think would make a difference, go ahead and apply with it! Some more project ideas can be found via NewFeatureDiscussions, CategoryNewFeatures and NewIdeas.

=== Lightweight copies/renames ===
(very difficult - a successful student will become an expert in Mercurial's storage format and transmission protocol)

Copies and renames currently are not too efficient. Mercurial copies the copied/renamed source file to the new initial revision of the target file in its internal history store. For renames, this is especially counter-intuitive, as renaming a large file grows the store by the file's size. It would be better if Mercurial had some way of referring to the existing revision from the new file, while preserving backwards compatbility and bounded I/O guarantees for retrieving revisions. See [[http://www.selenic.com/mercurial/bts/issue883|issue883]] for discussion. There's an mq from an old attempt at this located [[http://bitbucket.org/Murkt/hg-lwcopy-mq/|here]].

Contact: mpm, tonfa, cyanite

=== Parent Delta ===
(very difficult - a successful student will become an expert in Mercurial's storage format and transmission protocol)

Revlogs such as the manifest that have significant amounts of branching can suffer from revlog's linear delta model. Parent delta would allow storing deltas against parent revisions, greatly improving compression. Like lightweight copies, it will require extending the wire protocol to allow backwards compatibility.

Contact: mpm, tonfa

=== Better Changeset Discovery ===
(difficult - a successful student will become an expert in Mercurial's protocol and will experiment with difficult graph theory)

Mercurial currently uses a simple graph discovery protocol to discover what new changesets need to be pushed or pulled without transmitting the entire list available on either side. We suspect that a significantly better protocol exists that has fewer round trips and less data transfer, but research remains.

Some details can by found at DiscoveryPlan, you can also ask parren or tonfa on IRC.

Contact: parren, tonfa

=== Partial cloning ===
(difficult - a successful student will become familiar with most of Mercurial's core algorithms)

(existing work in progress - contact Peter Arrenbrecht, parren on irc)

Currently, it's only possible to clone one whole repository at a time. PartialClone and TrimmingHistory could help make cloning more efficient by limiting the cloning process in either of two dimensions: time or space. For time, we could maybe clone the last few [[ChangeSet|changesets]] and lazily fetch the rest as needed. For space, it would be nice if it was possible to clone just a subtree of any repositories. For these features, any number of thorny issues can arise because of current assumptions in Mercurial code. These are hard projects, but the result will be worth it to many Mercurial users (in terms of developers and in terms of projects).

Contact: parren

=== Instantaneous status on Windows, OS X ===
(difficult - a successful student will need to debug complex race conditions and master Mercurial's dirstate algorithms)

The InotifyExtension makes a huge difference to performance on moderate to large [[Repository|repositories]] on Linux (though it still has some difficult bugs). Windows and Mac OS X provide file status notification APIs, so it should be possible to port the inotify extension to one or other (or both) of these platforms, providing the same kinds of speed improvements as on Linux. (Don't try to do all of these in one project; instead, pick one platform you are comfortable with, read about the relevant APIs, then come up with a coherent proposal.)

Contact: nicdumz

=== TortoiseHg ===
TortoiseHg is a GUI front-end, similar to TortoiseCVS and TortoiseSVN. For many people, this makes interacting with Mercurial much easier. There's a lot of room for improvement. An applicant could pick one or more of:

 * add graphical UI to interesting extensions like bfiles and others
 * add a repository monitoring mechanism, to detect when a GUI application requires a refresh
 * add support to Mercurial for paramiko as an ssh agent, hooking password prompts through ui.password
 * integrate Meld, which is also written in PyGTK and just recently added support for Windows
 * tight integration of hgtk.exe into Visual Studio
 * graphical hunk splitting, as discussed in the next task

or their own ideas.

Contact: muggs

=== Interactive patch selection for commit/Mercurial Queues/record/import ===
Being able to select parts of the existing changes, with hunk or greater granularity, in an interactive way, can improve the use of commands and extensions that take changes, such as commit, MqExtension({{{Mercurial Queues}}}) and import. The RecordExtension currently allows patch hunk selection, but sometimes a better granularity is desired, as when a set of adjacent function definitions should go in different commits. This feature could be added as an {{{--interactive}}} mode for many of Mercurial's core commands.

=== Mercurial on Jython ===
Some Sun projects have taken to using Mercurial as their VCS of choice (OpenJDK, NetBeans; see ProjectsUsingMercurial for a full list). Recently, Mercurial has re-gained pure-Python implementations of the modules that are now in C. It would be nice if someone could take the effort to get this code to run on Jython, opening up Mercurial for use from within the JDK.

=== Mercurial on Py3k ===
(difficult - a successful student will have to master Mercurial's pervasive approach to character set handling and reconcile it with Py3k's)

In 2008, the Python project has finally released their 3.0 branch of Python (affectionately known as Python 3000). This is a major release of Python, with some large changes. Porting to Py3k would be an interesting effort, operating on the current edge of the software, with nice long-term viability results for Mercurial.

=== GUI integration ===
Mercurial is currently lacking features to support integration into a GUI, like TortoiseHg or Mercurial Eclipse. One example of a feature that would benefit GUI is [[http://www.nabble.com/Progress-bar-on-network-actions-tt20596124.html#a20596124|a progress indicator]] on operations that may take some time.

=== Web interface work ===
The hgwebdir repository interface is one of the successful parts of Mercurial. It allows projects to quickly setup a web interface with access to multiple repositories, and it has some decent ACL features. It's a Python WSGI application, meaning it's relatively easy to run on many platforms and behind differing web servers. It can still be improved, though: there are still some holes in the WSGI implementation (notably, it writes to stdout/stderr in some error conditions) and the access control features aren't always functional enough for complex setups.

Also, improved support for named branches and filtering (like hg log -f and the like) are often requested features.

Also, improving/updating the django application "FreeHg" would be an option, with:

 * server side clone
 * access rights, with users/groups

Contact: djc

=== Conversion tools ===
Mercurial is a relatively new entrant in the VCS market, and many projects are still using older VCSs such as CVS and SVN. While we currently have some tools to help migrate to Mercurial in the form of the ConvertExtension, these tools could certainly use more improvements. Specifically, enabling the use of Mercurial as a client for SVN or even git and/or Bazaar-NG repositories would be very nice, as it enables developers to make their own choice regarding the use of their VCS client, thereby drastically enlarging our userbase.

The HgSubversion extension already works quite well, but better integration with the existing Mercurial user interface would be nice.

Contact: durin42, pmezard

=== Performance tuning ===
(difficult - a successful student will master performance benchmarking, tuning Python algorithms, and possibly writing C extensions in Python)

There are numerous opportunities for additional performance tuning. Possibilities include:

 * faster startup (can we optimize Python load time further? can we minimize loading of unused code further?)
 * faster status (can we further improve directory walking? ignore logic? dirstate parsing?)
 * faster checkout (can we reduce the overhead of our file I/O paths?)

A project in this area would probably involve picking an important benchmark (eg checkout of a large repository) and tuning multiple areas to meet a performance goal (eg 2x overall performance increase).

== Mentors ==
A list of potential mentors.

 * [[mpm|MattMackall]] (mpm on IRC; creator of Mercurial)
 * DirkjanOchtman (djc on IRC)
 * PeterArrenbrecht (parren on IRC)
 * AugieFackler (durin42 on IRC)
 * [[tonfa|Benoit Boissinot]] (tonfa on IRC)
 * MartinGeisler (mg on IRC)
 * BrodieRao (brodie on IRC)

== Students Considering Application ==
''Go on, list your name here!''

 * [[PaulMalmsten|Paul Malmsten]]
 * [[MarkDetermann|Mark Determann]]
 * [[RafaelMartins|Rafael Martins]]
 * [[VishakhHarikumar|Vishakh Harikumar]]
 * [[Pradeepkumar]]
 * [[AlexGolec|Alexander Golec]]
 * [[DanielHalmschlager|Daniel Halmschlager]]
 * [[RenatoCunha|Renato Cunha]]

== Applications Received From ==
''Not Yet''.

== Project status ==
''Not Yet''.

See http://socghop.appspot.com/gsoc/org/home/google/gsoc2010/hg

for more up to date information.

== Other GSoc Projects related to Mercurial ==
Line 170: Line 42:
CategoryNewFeatures

~-"Google Summer of Code" is a trademark by Google Inc.-~
CategoryGsoc

Google Summer of Code

Information about Mercurial's GSoC program.

1. About Mercurial and GSoC

For the last few years, Google has offered a fantastic opportunity for students to help out Open Source software projects in the summer while getting paid for it. It's called Google Summer of Code™, and it provides free software projects a great way of attracting development effort while providing software developers who are still in university with some interesting and useful experiences. Find out more about the Summer of Code (SoC) from their site.

Mercurial is a popular project with many enthusiastic users and a dynamic development community. By participating in GSoC with Mercurial, you'll be exposed to exciting technology and top-notch developers.

<!> See Ideas for project ideas and add yourself to this year's status page to get involved.

1.1. Diversity

GSoC and open source development in general attracts a wide range of participants from around the world, but historically women have been very under-represented. So this year, Mercurial (and other projects under the Python umbrella) are actively encouraging more women to apply. Read more about Python's diversity efforts.

2. Notes on applying

Here are some tips on how to apply:

  • Get acquainted! Introduce yourself on the MailingLists and talk to us on IRC. We're much more likely to accept applications from people we recognize.

  • Get feedback on your proposal! We want to know all about you and your proposal before we have to choose between projects.
  • Submit a patch! We ask applicants to pick a simple issue from the BugTracker and send a patch for it. This will help demonstrate some of the basic skills you'll need for your summer project (using our tools, reading the code, talking to people, etc.).

3. Getting things done

The primary challenge of GSoC is not technical! Your goal as a GSoC student should be learning how to interact with other developers on a real-world project, and how to develop effectively in a cooperative environment.

  • Focus. Working on Mercurial in the summer should be your main activity. Realize that collaborating on software development takes time, and not just the time used to reason about and write the code. This also means that we want you to set some intermediate milestones to be able to keep track of your progress.
  • Communicate. In some ways, open source software development is more about communicating than about writing code. Some part of your time will be spent writing code, but a large part of the time spent will go towards explaining the code on mailing lists, asking and answering questions on IRC and in general reasoning about proposed software changes. If you can't do this, you take the risk of not understanding the goals of our project or not being able to explain why your patch is necessary.
  • Participate. Developing your own code is only part of the open source experience. You should also be getting involved with discussions on our mailing lists with users and developers, commenting on other code, responding to bug reports, editing the wiki, etc.


CategoryGsoc

SummerOfCode (last edited 2018-09-04 17:05:38 by MartinVonZweigbergk)